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Abstract 

We provide an assessment of the impact of parton distributions on the determi- 
nation of LHC processes, and of the accuracy with which parton distributions 
(PDFs) can be extracted from data, in particular from current and forthcoming 
HERA experiments. We give an overview of reference LHC processes and 
their associated PDF uncertainties, and study in detail W and Z production at 
the LHC. We discuss the precision which may be obtained from the analysis of 
existing HERA data, tests of consistency of HERA data from different experi- 
ments, and the combination of these data. We determine further improvements 
on PDFs which may be obtained from future HERA data (including measure- 
ments of Fl), and from combining present and future HERA data with present 
and future hadron collider data. We review the current status of knowledge of 
higher (NNLO) QCD corrections to perturbative evolution and deep-inelastic 
scattering, and provide reference results for their impact on paiton evolution, 
and we briefly examine non-perturbative models for parton distributions. We 
discuss the state-of-the art in global parton fits, we assess the impact on them of 
various kinds of data and of theoretical corrections, by providing benchmarks 
of Alekhin and MRST parton distributions and a CTEQ analysis of parton fit 
stability, and we briefly presents proposals for alternative approaches to par- 
ton fitting. We summarize the status of large and small x resummation, by 
providing estimates of the impact of large x resummation on parton fits, and 
a comparison of different approaches to small x resummation, for which we 
also discuss numerical techniques. 
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1 Introduction 

The physics of parton distributions, especially within the context of deep-inelastic scattering (DIS), has 
been an active subject of detailed theoretical and experimental investigations since the origins of pertur- 
bative quantum chromodynamics (QCD), which, thanks to asymptotic freedom, allows one to determine 
perturbatively their scale dependence [1-5]. 

Since the advent of HERA, much progress has been made in determining the Parton Distribution 
Functions (PDFs) of the proton. A good knowledge of the PDFs is vital in order to make predictions 
for both Standard Model and beyond the Standai^d Model processes at hadronic colliders, specifically the 
LHC. Furthermore, PDFs must be known as precisely as possible in order to maximize the discovery po- 
tential for new physics at the LHC. Conversely, LHC data will lead to an imporvement in the knowledge 
of PDFs. 

The main aim of this document is to provide a state-of-the ait assessment of the impact of paiton 
distributions on the determination of LHC processes, and of the accuracy with which parton distributions 
can be extracted from data, in particular current and forthcoming HERA data. 

In Section 2 we will set the stage by providing an overview of relevant LHC processes and a 
discussion of their experimental and theoretical accuracy. In Section 3 we will turn to the experimental 
determination of PDFs, and in particular examine the improvements to be expected from forthcoming 
measurements at HERA, as well as from analysis methods which allow one to combine HERA data with 
each other, and also with data from existing (Tevatron) and forthcoming (LHC) hadron colliders. In 
Section 4 we will discuss the state of the art in the extraction of parton distributions of the data, by first 
reviewing recent progress in higher-order QCD corrections and their impact on the extraction of PDFs, 
and then discussing and comparing the determination of PDFs from global fits. Finally, in Section 5 we 
will summarize the cun^ent status of resummed QCD computations which are not yet used in parton fits, 
but could lead to an improvement in the theoretical precision of PDF determinations. 

Whereas we will aim at summarizing the state of the art, we will also provide several new re- 
sults, benchmarks and predictions, which have been obtained within the framework of the HERA-LHC 
workshop. 
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2 LHC final states and their potential experimental and theoretical accuracies^ 
2.1 Introduction 

Cross section calculations and experimental simulations for many LHC reactions, within the Standard 
Model and for many new physics scenarios have been performed during the last 20 years. These studies 
demonstrate how various final states might eventually be selected above Standard Model backgrounds 
and indicate the potential statistical significance of such measurements. In general, these studies assumed 
that the uncertainties from various sources, like the PDF uncertainties, the experimental uncertainties and 
the various signal and background Monte Carlo simulations will eventually be controlled with uncertain- 
ties small compared to the expected statistical significance. This is the obvious approach for many so 
called discovery channels with clean and easy signatures and relatively small cross sections. 

However, during the last years many new and more complicated signatures, which require more 
sophisticated selection criteria, have been discussed. These studies indicate the possibility to perform 
more ambitious searches for new physics and for precise Standard Model tests, which would increase the 
physics potential of the LHC experiments. Most of these studies concentrate on the statistical significance 
only and potential systematic limitations are rarely discussed. 

In order to close this gap from previous LHC studies, questions related to the systematic limits 
of cross section measurements from PDF uncertainties, from imperfect Standard Model Monte Carlo 
simulations, from various QCD uncertainties and from the efficiency and luminosity uncertainties were 
discussed within the PDF working group of this first HERA-LHC workshop. The goal of the studies 
presented during the subgroup meetings during the 2004/5 HERA LHC workshop provide some answers 
to questions related to these systematic limitations. In particular, we have discussed potential experi- 
mental and theoretical uncertainties for various Standard Model signal cross sections at the LHC. Some 
results on the experimental systematics, on experimental and theoretical uncertainties for the inclusive 
W, Z and for diboson production, especially related to uncertainties from PDF's and from higher order 
QCD calculations are described in the following sections. 

While it was not possible to investigate the consequences for various aspects of the LHC physics 
potential in detail, it is important to keep in mind that many of these Standard Model reactions are 
also important backgrounds in the search for the Higgs and other exotic phenomena. Obviously, the 
consequences from these unavoidable systematic uncertainties need to be investigated in more detail 
during the coming yeai's. 



Subsection coordinator: Michael Dittmar 
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2.2 Measuring and interpreting cross sections at the LHC ° 

The LHC is often called a machine to make discoveries. However, after many years of detailed LHC 
simulations, it seems clear that relatively few signatures exist, which do not involve cross section mea- 
surements for signals and the various backgrounds. Thus, one expects that cross section measurements 
for a large variety of well defined reactions and their interpretation within or perhaps beyond the Standard 
Model will be one of the main task of the LHC physics program. 

While it is relatively easy to estimate the statistical precision of a particular measurement as a func- 
tion of the luminosity, estimates of potential systematic eiTors are much more complicated. Furthmore, 
as almost nobody wants to know about systematic limitations of future experiments, detailed studies are 
not rewarding. Nevertheless, realistic estimates of such systematic errors are relevant, as they might 
allow the LHC community to concentrate their efforts on the areas where current systematic errors, like 
the ones which are related to uncertainties from Parton Distribution Functions (PDF) or the ones from 
missing higher order QCD calculations, can still be improved during the next years. 

In order to address the question of systematics, it is useful to start with the basics of cross section 
measurements. Using some clever criteria a particular signature is separated from the data sample and 
the surviving Nobscrvcd events can be counted. Backgrounds, Nbackground> from various sources have 
to be estimated either using the data or some Monte Carlo estimates. The number of signal events, 
Nsignab is then obtained from the difference. In order to turn this experimental number of signal events 
into a measurement one has to apply a correction for the efficiency. This experimental number can 
now be compared with the product of the theoretical production cross section for the considered process 
and the corresponding Luminosity. For a measurement at a hadron collider, like the LHC, processes 
are calculated on the basis of quark and gluon luminosities which are obtained from the proton-proton 
luminosity "folded" with the PDF's. 

In order to estimate potential systematic en^ors one needs to examine carefully the various ingredi- 
ents to the cross section measurement and their interpretation. First, a measurement can only be as good 
as the impact from of the background uncertainties, which depend on the optimized signal to background 
ratio. Next, the experimental efficiency uncertainty depends on many subdetectors and their actual real 
time performance. While this can only be known exactly from real data, one can use the systematic 
error estimates from previous experiments in order to guess the size of similar eiTor sources for the fu- 
ture LHC experiments. We are furthermore confronted with uncertainties from the PDF's and from the 
proton-proton luminosity. If one considers all these areas as essentially experimental, then one should 
assign uncertainties originating from imperfect knowledge of signal and background cross sections as 
theoretical. 

Before we try to estimate the various systematic errors in the following subsections, we believe 
that it is important to keep in mind that particular studies need not to be much more detailed than the 
largest and limiting uncertainty, coming from either the experimental or the theoretical area. Thus, one 
should not waste too much time, in order to achieve the smalled possible uncertainty in one particular 
area. Instead, one should try first to reduce the most important error sources and if one accepts the "work 
division" between experimental and theoretical contributions, then one should simply try to be just a 
little more accurate than either the theoretical or the experimental colleagues. 

2.2. 1 Guessing experimental systematics for ATLAS and CMS 

In order to guess experimental uncertainties, without doing lengthy and detailed Monte Carlo studies, it 
seems useful to start with some simple and optimistic assumptions about ATLAS and CMS^^. 

First of all, one should assume that both experiments can actually operate as planned in their 
^^Contributing author: Michiael Dittmar 

"'Up to date performance of the ATLAS and CMS detectors and further detailed references can be found on the corresponding 
homepages http://atlas. web . cern . ch/At las/ and http : / / cms info . cern . ch/ Wei come . html/ 
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proposals. As the expected performance goals are rather similar for both detectors the following list of 
measurement capabilities looks as a reasonable first guess. 

• Isolated electrons, muons and photons with a transverse momentum above 20 GeV and a pseu- 
dorapidity r] with \r]\ < 2.5 are measured with excellent accuracy and high (perhaps as large as 
95% for some reactions) "homogeneous" efficiency. Within the pseudo rapidity coverage one can 
assume that experimentalists will perhaps be able, using the large statistics from leptonic W and Z 
decays, to control the efficiency for electrons and muons with a 1% accuracy. For simphcity, one 
can also assume that these events will allow to control measurements with high energy photons to 
a similar accuracy. For theoretical studies one might thus assume that high pt electrons, muons 
and photons and \r]\ < 2.5 are measured with a systematic uncertainty of it 1% for each lepton 
(photon). 

• Jets are much more difficult to measure. Optimistically one could assume that jets can be seen 
with good efficiency and angular accuracy if the jet transverse momentum is larger than 30 GeV 
and if their pseudo rapidity fulfills |r/| < 4.5. The jet energy resolution is not easy to quantify, but 
numbers could be given using some "reasonable" assumptions like AE/E « 100 — 150%/ ^/E. 
For various measurements one want to know the uncertainty of the absolute jet energy scale. Var- 
ious tools, like the decays of W ^ qq in ti events or the photon-jet final state, might be used to 
calibrate either the mean value or the maximum to reasonably good accuracy. We believe that only 
detailed studies of the particular signature will allow a quantitative estimate of the uncertainties 
related to the jet energy scale measurements. 

• The tagging of b-flavoured jets can be done, but the efficiency depends strongly on the potential 
backgrounds. Systematic efficiency uncertainties for the b-tagging are difficult to quantify but 
it seems that, in the absence of a new method, relative b-tagging uncertainties below it 5% are 
almost impossible to achieve. 

With this baseline LHC detector capabilities, it seems useful to divide the various high q"^ LHC 
reactions into essentially five different non overlapping categories. Such a devision can be used to make 
some reasonable accurate estimates of the different systematics. 

• Drell-Yan type lepton pair final states. This includes on- and off-shell W and Z decays. 

• 7-jet and 77X final states. 

• Diboson events of the type WW, WZ, ZZ, Wj with leptonic decays of the W and Z bosons. 
One might consider to include the Standard Model Higgs signatures into this group of signatures. 

• Events with top quarks in the final state, identified with at least one isolated lepton. 

• Hadronic final states with up to n(=2,3 ..) Jets and different pt and mass. 

With this "grouping" of experimental final states, one can now start to analyze the different po- 
tential error sources. Where possible, one can try to define and use relative measurements of various 
reactions such that some systematic errors will simply cancel. 

Starting with the resonant W and Z production with leptonic decays, several million of clean 
events will be collected quickly, resulting in relative statistical errors well below ±1%. Theoretical 
calculations for these reactions are well advanced and these reactions are among the best understood 
LHC final states allowing to build the most accurate LHC Monte Carlo generators. Furthermore, some 
of the experimental uncertainties can be reduced considerably if ratio measurements of cross section, 
such as W^ /W~ and Z/W, are performed. The similarities in the production mechanism should also 
allow to reduce theoretical uncertainties for such ratios. The experimental counting accuracy of W and 
Z events, which includes background and efficiency corrections, might achieve eventually uncertainties 
of 1% or slightly better for cross section ratios. 

Furthermore, it seems that the shape of the pt distribution of the Z, using the decay into electron 
pairs {pp ZX e+e~ X), can be determined with relative accuracies of much less than 1%. This 
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distribution, shown in figure 1, can be used to tune the Monte Cai^lo description of this particular- process. 
This tuning of the Monte Carlo can than be used almost directly to predict theoretically also the W pt 
spectrum, and the pt spectrum for high mass Drell-Yan lepton pair events. Once an accurate model 
description of these Standard Model reactions is achieved one might use these insights also to predict the 
Pt spectrum of other well defined final states. 



pp ^ Z°X ^ e*e- X 



pp ^ Z°X ^ e*e- X 



LHC (14 TeV) L = Ifb^^ (PYTHIA LO) 

Z° p, spectrum (with |y(Z °)|<2) 



40 60 

p,(Z°) [GeV] 



LHC (1 4 TeV) L = 1 fb"' (PYTHIA LO) 

Z° p, spectrum (with |y(Z °)|<2) 




p,(Z°) [GeV] 



Fig. 1: Simple simulation of a potential measurement of the Z pt spectrum, possible with a luminosity of only 1 fb ^. Who 
will be able to predict this pt spectrum in all its beauty and with similar accuracy? 

From all the various high reactions, the inclusive production of W and Z events is known to be 
the theoretically best understood and best experimentally measurable LHC reaction. Consequently, the 
idea to use these simple well defined final states as the LHC cross section normalisation tool, or standard 
candle was described first in reference [6]. This study indicated that the W and Z production might result 
in a precise and simple parton luminosity monitor. In addition, these reactions can also be used to im- 
prove the relative knowledge of the PDF's. In fact, if one gives up on the idea to measure absolute cross 
sections, the relative parton luminosity can in principle be determined with relative uncertainties well be- 
low ±5%, the previously expected possible limit for any absolut proton-proton luminosity normalisation 
procdure. 

In summary, one can estimate that it should be possible to reduce experimental uncertainties for 
Drell-Yan processes to systematic uncertainties below ±5%, optimistically one might envisage an event 
counting accuracy of perhaps ±1%, limited mainly from the lepton identification efficiency. 

The next class of final states, which can be measured exclusively with leptons, are the diboson pair 
events with subsequent leptonic decays. Starting with the ZZ final state, we expect that the statistical 
accuracy will dominate the measurement for several years. Nevertheless, the systematic uncertainties of 
the measurement, based on four leptons, should in principle be possible with relative eiTors of a few % 
only. 

The production of WZ and WW involves unmeasurable neutrinos. Thus, experimentally only an 
indirect and incomplete determination of the kinematics of the final states is possible and very detailed 
simulations with precise Monte Carlo generators are required for the interpretation of these final state. 
It seems that a measurement of the event counting with an accuracy below ±5%, due to efficiency 
uncertainties from the selection alone, to be highly non trivial. Nevertheless, if the measurements and 
the interpretations can be done relative to the W and Z resonance production, some uncertainties from 
the lepton identification efficiency, from the PDF and from the theoretical calculation can perhaps be 
reduced. Without going into detailed studies for each channel, one could try to assume that a systematic 



10 



uncertainty of ±5% might be defined as a goal. Similar characteristics and thus limitations can be 
expected for other diboson signatures. 

The production cross section of top antitop quark pairs is large and several million of semilep- 
tonic tagged and relatively clean events (j)p tt ^ WbWb identified with one leptonic W decay) 
can be expected. However, the signature involves several jets, some perhaps tagged as b-flavoured, and 
missing transverse momentum from the neutrino(s). The correct association of the various jets to the 
corresponding top quark is known to be extremely difficult, leading to large combinatorial backgrounds. 
Thus, it seems that, even if precise Monte Carlo generators will become eventually available, that system- 
atic uncertainties smaller than 5-10% should not be expected. Consequently, we assume that top antitop 
backgrounds for a wide class of signals can not be determined with uncertainties smaller than 5-10%. 

Measurements of so called "single" top quarks are even more difficult, as the cross section is 
smaller and larger backgrounds exist. Systematic en^ors will therefore always be larger than the one 
guessed for top-antitop pair production. 

Finally, we can address the QCD jet production. Traditionally one measures and interprets the so 
called jet cross section as a function of pt jet and the mass of the multi jet system using various rapidity 
intervals. With the steeply falling pt jet specti"um and essentially no background, one will determine the 
differential spectrum such that only the slope has to be measured with good relative accuracy. If one is 
especially interested into the super high mass or high pt events, then we expect that migrations due to jet 
mis-measurements and non Gaussian tails in the jet energy measurements will limit any measurement. 
A good guess might be that the LHC experiments can expect absolut normalisation uncertainties similar 
to the ones achieved with CDF and DO, corresponding to uncertainties of about it 10-20%. 

Are the above estimated systematic limits for the various measurements pessimistic, optimistic or 
simply realistic? Of course, only real experiments will tell during the coming LHC years. However, while 
some of these estimates will need perhaps some small modification, they could be used as a limit waiting 
to be improved during the coming years. Thus, some people full of ideas might take these numbers 
as a challenge, and discover and develop new methods that will improve these estimates. This guess 
of systematic limitations for LHC experiments could thus be considered as a "provocation", which will 
stimulate activities to prove them wrong. In fact, if the experimental and theoretical communities could 
demonstrate why some of these "pessimistic" numbers are wrong the future real LHC measurements 
will obviously benefit from the required efforts to develope better Monte Carlo programs and better 
experimental methods. 

The following summary from a variety of experimental results from previous high energy collider 
experiments might help to quantify particular areas of concern for the LHC measurements. These pre- 
vious measurements can thus be used as a starting point for an LHC experimenter, who can study and 
explain why the corresponding errors at LHC will be smaller or lai^ger. 

2.2.2 Learning from previous collider experiments 

It is broadly accepted, due to the huge hadronic interaction rate and the short bunch crossing time, that 
the experimental conditions at the LHC will be similar or worse than the ones at the Tevatron collider. 
One experimental answer was to improve the granularity, speed and accuracy of the different detector 
elements accordingly. Still, no matter how well an experiment can be realized, the LHC conditions to 
do experiments will be much more difficult than at LEP or any hypothetical future high energy e+e~ 
collider. One important reason is the large theoretical uncertainty, which prevents to make signal and 
background Monte Carlos with accuracies similar to the ones which were used at LEP. 

Thus, we can safely expect that systematic enors at LHC experiments will be larger than the 
corresponding ones from LEP and that the Tevatron experience can be used as a first guess. 

• Measurement of cjxBR for W and Z production from CDF [7] and DO [8]: 

The CDF collaboration has presented a high statistics measurement with electrons and muons. 
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Similar systematic errors of about it 2% were achieved for efficiency and thus the event counting 
with electrons and muons. The error was reduced to it 1.4% for the ratio measurement where some 
lepton identification efficiencies cancel. Similar errors about x 1.5-2 larger have been obtained by 
the corresponding measurements from the DO experiment. 

• Measurement of the cross section f or pp — > ^7(7) from DO [9]: 

A total of 138 ee7 and 152 fifx^ candidate events were selected. The background was estimated 
to be about 10% with a systematic uncertainty of it 10-15%, mainly from 7-jet misidentifica- 
tion. Using Monte Carlo and a large sample of inclusive Z events, the efficiency uncertainty has 
been estimated to be w 5% and when the data were used in comparison with the Standard Model 
prediction another uncertainty of 3.3% originating from PDF's was added. 

• Measurement of the pp — > tt production cross section from CDF [10] 

A recent CDF measurement, using 197 pb~^, obtained a cross section (in pb) of 7.0 -1-2.4 (-2.1) 
from statistics. This should be comapred with -1-1.7 (-1.2) from systematics, which includes ±0.4 
from the luminosity measurement. Thus, uncertainties from efficiency and background are roughly 
±20%. It is expected that some of the uncertainties can be reduced with the expected 10 fold lumi- 
nosity increase such that the systematic error will eventually decrease to about ± 10%, sufficient 
to be better than the expected theoretical eiTor of ± 15%. 

• A search for Supersymmetry with b-tagged jets from CDF [11]: 

This study, using single and double b-tagged events was consistent with background only. How- 
ever, it was claimed that the background uncertainty was dominated by the systematic error, which 
probably originated mostly from the b tagging efficiency and the misidentification of b-flavoured 
jets. The numbers given were 16.4± 3.7 events (3.15 from systematics) for the single b-tagged 
events and 2.6±0.7 events (0.66 from systematics) for the double b-tagged events. These errors 
originate mainly from the b-tagging efficiency uncertainties, which are found to roughly ± 20-25% 
for this study of rare events. 

• Some "random" selection of recent e+e" measurements: 

A recent measurement from ALEPH (LEP) of the W branching ratio to qq estimated a systematic 
uncertainty of about ± 0.2% [12]. This small uncertainty was possible because many additional 
constraints could be used. 

OPAL has reported a measurement of Ri, at LEP II energies, with a systematic uncertainty of ± 
3.7%. Even though this uncertainty could in principle be reduced with higher statistics, one can 
use it as an indication on how large efficiency uncertainties from b-tagging are already with clean 
experimental conditions [13] 

Recently, ALEPH and DELPHI have presented cross section measurements for e+e" 77 with 
systematic errors between 2.2% (ALEPH) [14] and 1.1% (DELPHI) [15]. In both cases, the effi- 
ciency uncertainty, mainly from conversions, for this in principle easy signal was estimated to be 
roughly 1%. In the case of ALEPH an uncertainty of about ±0.8% was found for the background 
correction. 

Obviously, these measurements can only be used, in absence of anything better, as a most op- 
timistic guess for possible systematic limitations at a hadron collider. One might conclude that the 
systematics from LEP experiments give (1) an optimistic limit for comparable signatures at the LHC and 
(2) that the results from CDF and DO should indicate systematics which might be obtained realistically 
during the early LHC years. 

Thus, in summary the following list might be used as a first order guess on achievable LHC 
systematics^^ . 

• "Isolated" muons, electrons and photons can be measured with a small momentum (energy) un- 
certainty and with an almost perfect angular resolution. The efficiency for pt > 20 GeV and 

^^Reality will hopefully show new brilliant ideas, which combined with hard work will allow to obtain even smaller uncer- 
tainties. 
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\r]\ < 2.5 will be "high" and can be controlled optimistically to it 1%. Some straight forward 
selection criteria should reduce jet background to small or negligible levels. 

• "Isolated" jets with a pt > 30 GeV and \r]\ < 4.5 can be seen with high (veto) efficiency and 
a small uncertainty from the jet direction measurement. However, it will be very difficulty to 
measure the absolute jet energy scale and Non-Gaussian tails will limit the systematics if the jet 
energy scale is important. 

• Measurements of the missing transverse momentum depend on the final state but will in general 
be a sum of the errors from the lepton and the jet accuracies. 

Using these assumptions, the following "optimistic" experimental systematic errors can be used 
as a guideline: 

1 . Efficiency uncertainties for isolated leptons and photons with a pt above 20 GeV can be estimated 
with a lb 1 % accuracy. 

2. Efficiencies for tagging jets will be accurate to a few percent and the efficiency to tag b-flavoured 
jets will be known at best within ±5%. 

3. Backgrounds will be known, combining theoretical uncertainties and some experimental determi- 
nations, at best with a ±5-10% accuracy. Thus, discovery signatures without narrow peaks require 
signal to background ratios larger than 0.25-0.5, if 5 a discoveries are claimed. Obviously, for 
accurate cross section measurements, the signal to background ratio should be much larger. 

4. In case of ratio measurements with isolated leptons, like pp — > /pp — > W~, relative errors 
between 0.5-1% should be possible. Furthermore, it seems that the measurement of the shape of Z 
Pt spectrum, using Z— > e"^e^, will be possible with a systematic error much smaller than 1%. As 
the Z cross section is huge and clean we expect that this signature will become the best measurable 
final state and should allow to test a variety of production models with errors below it 1%, thus 
challenging future QCD calculations for a long time. 
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2.3 Uncertainties on W and Z production at tlie LHC 

2.3.1 Introduction 

At leading order (LO), W and Z production occur by the process, qq W/Z, and the momentum 
fractions of the partons participating in this subprocess are given by xi^2 = ^ exp(iby), where M is 
the centre of mass energy of the subprocess, M = My/ or Mz, yjs is the centre of mass energy of 
the reaction (y^s = 14 TeV at the LHC) and y = ^ In gives the parton rapidity. The kinematic 

plane for LHC parton kinematics is shown in Fig. 2. Thus, at central rapidity, the participating partons 
have small momentum fractions, x ^ 0.005. Moving away from central rapidity sends one paiton to 
lower X and one to higher x, but over the measurable rapidity range, \y\ < 2.5, x values remain in 
the range, 10~^ < a; < 0.1. Thus, in contrast to the situation at the Tevatron, valence quarks are not 
involved, the scattering is happening between sea quarks. Furthermore, the high scale of the process 
= ~ 10, 000 GeV^ ensures that the gluon is the dominant parton, see Fig. 2, so that these sea 
quarks have mostly been generated by the flavour blind g ^ qq splitting process. Thus the precision of 
our knowledge of W and Z cross-sections at the LHC is crucially dependent on the uncertainty on the 
momentum distribution of the gluon. 




Fig. 2: Left plot: The LHC kinematic plane (tlianks to James Stirling). Right plot: PDF distributions at = 10, 000 GeV^. 

HERA data have dramatically improved our knowledge of the gluon, as illustrated in Fig. 3, which 
shows W and Z rapidity spectra predicted from a global PDF fit which does not include the HERA data, 
compared to a fit including HERA data. The latter fit is the ZEUS-S global fit [16], whereas the former is 
a fit using the same fitting analysis but leaving out the ZEUS data. The full PDF uncertainties for both fits 
are calculated from the error PDF sets of the ZEUS-S analysis using LHAPDF [17] (see the contribution 
of M.Whalley to these proceedings). The predictions for the W/Z cross-sections, decaying to the lepton 
decay mode, are summarised in Table 1. The uncertainties in the predictions for these cross-sections 
have decreased from ~ 16% pre-HERA to ~ 3.5% post-HERA. The reason for this can be seen clearly 
in Fig. 4, where the sea and gluon distributions for the pre- and post-HERA fits are shown for several 
different bins, together with their uncertainty bands. It is the dramatically increased precision in the 

^'Contributing authors: Alessandro Tricoli, Amanda Cooper-Sarkar, Claire Gwenlan 
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PDF Set a{W+).B{W+ ^l+i^i) a{W~).B{W~ rvi) a{Z).B{Z ^l+ l 

ZEUS-SnoHERA 10.63 ± 1.73 nb 7.80 ± 1.18 nb 1.69 ± 0.23 nb 

ZEUS-S 12.07 ± 0.41 nb 8.76 ± 0.30 nb 1.89 ± 0.06 nb 

CTEQ6.1 11.66 ± 0.56 nb 8.58 ± 0.43 nb 1.92 ± 0.08 nb 

MRSTOl 11.72 ± 0.23 nb 8.72 ± 0.16 nb 1.96 ± 0.03 nb 



Table 1 : LHC W/Z cross-sections for decay via tlie lepton mode, for various PDFs 



HERA excluded 
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Fig. 3: LHC , W~ , Z rapidity distributions and their PDF uncertainties (the full line shows the central value and the dashed 
lines show the spread of the uncertainty): Top Row: from the ZEUS-S global PDF analysis not including HERA data; left plot 
middle plot W^; right plot Z: Bottom Row: from the ZEUS-S global PDF analysis including HERA data; left plot W+; 
middle plot W~; right plot Z 



low-rr gluon PDF, feeding into increased precision in the low-x sea quarks, which has led to the increased 
precision on the predictions for W/Z production at the LHC. 

Further evidence for the conclusion that the uncertainties on the gluon PDF at the input scale 
(Qq = 7 GeV^, for ZEUS-S) are the major contributors to the uncertainty on the W/Z cross-sections at 
= M\y{Mz), comes from decomposing the predictions down into their contributing eigenvectors. 
Fig 5 shows the dominant contributions to the total uncertainty from eigenvectors 3, 7, and 11 which are 
eigenvectors which ai"e dominated by the pai^ameters which control the low-x, mid-x and high-x, gluon 
respectively. 

The post-HERA level of precision illustrated in Fig. 3 is taken for granted in modem analyses, such 
that W/Z production have been suggested as 'standard-candle' processes for luminosity measurement. 
However, when considering the PDF uncertainties on the Standard Model (SM) predictions it is necessary 
not only to consider the uncertainties of a particular PDF analysis, but also to compare PDF analyses. 
Fig. 6 compares the predictions for W~^ production for the ZEUS-S PDFs with those of the CTEQ6. 1 [18] 
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Fig. 4: Sea (xS) and gluon (xg) PDFs at various Q^: left plot; from the ZEUS-S global PDF analysis not including HERA 
data; right plot: from the ZEUS-S global PDF analysis including HERA data. The inner cross-hatched error bands show 
the statistical and uncorrelated systematic uncertainty, the outer error bands show the total uncertainty including experimental 
correlated systematic uncertainties, normalisations and model uncertainty. 
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Fig. 6: LHC VK+ rapidity distributions and tlieir PDF uncertainties: left plot, ZEUS-S PDFs; middle plot, CTEQ6.1 PDFs; 
riglit plot: MRSTOl PDFs. 

PDFs and the MRSTOl [19] PDFs^*^. The corresponding cross-sections, for decay to leptonic mode 
are given in Table 1. Comparing the uncertainty at central rapidity, rather than the total cross-section, we 
see that the uncertainty estimates are rather larger: 5.2% for ZEUS-S; 8.7% for CTEQ6.1M and about 
3.6% for MRSTOl. The difference in the central value between ZEUS-S and CTEQ6.1 is 3.5%. Thus 
the spread in the predictions of the different PDF sets is comparable to the uncertainty estimated by the 
individual analyses. Taking all of these analyses together the uncertainty at central rapidity is about 8%. 

Since the PDF uncertainty feeding into the W^, and Z production is mostly coming from 
the gluon PDF, for all three processes, there is a strong correlation in their uncertainties, which can be 
removed by taking ratios. Fig. 7 shows the W asymmetry 

Aw = {W+ - W-)/{W+ + W-). 

for CTEQ6.1 PDFs, which have the largest uncertainties of published PDF sets. The PDF uncertainties on 
the asymmetry are very small in the measurable rapidity range. An eigenvector decomposition indicates 
that sensitivity to high-x u and d quark flavour distributions is now evident at large y. Even this residual 
flavour sensitivity can be removed by taking the ratio 

Azw = Z/{W-^ + W-) 

as also shown in Fig. 7. This quantity is almost independent of PDF uncertainties. These quantities have 
been suggested as benchmarks for our understanding of Standard Model Physics at the LHC. However, 
whereas the Z rapidity distribution can be fully reconstructed from its decay leptons, this is not possible 
for the W rapidity distribution, because the leptonic decay channels which we use to identify the W's 
have missing neutrinos. Thus we actually measure the W's decay lepton rapidity spectra rather than the 
W rapidity spectra. The lower half of Fig. 7 shows the rapidity spectra for positive and negative leptons 
from and decay and the lepton asymmetry, 

A = (/+-r )/(/+ + /-). 

A cut of, pti > 25 GeV, has been applied on the decay lepton, since it will not be possible to trigger 
on leptons with small pu. A particular lepton rapidity can be fed from a range of W rapidities so that 
the contributions of partons at different x values is smeared out in the lepton spectra, but the broad 
features of the W spectra and the sensitivity to the gluon parameters remain. The lepton asymmetry 
shows the change of sign at large y which is characteristic of the V — A structure of the lepton decay. 
The cancellation of the uncertainties due to the gluon PDF is not so perfect in the lepton asymmetry as 
in the W asymmetry. Nevertheless in the measurable rapidity range sensitivity to PDF parameters is 
small. Correspondingly, the PDF uncertainties are also small ( 4%) and this quantity provides a suitable 
Standard Model benchmark. 

^"'MRSTOl PDFs are used because the full error analysis is available only for this PDF set. 
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Fig. 7: Predictions for W, Z production at the LHC from tlie CTEQ6. 1 PDFs. Top row: left plot, the W asymmetry. Aw', right 
plot, the ratio, Azw'- Bottom row: left plot, decay e+ rapidity spectrum; middle plot, decay e" rapidity spectrum; right plot, 
lepton asymmetry, Ae 

In summary, these preliminary investigations indicate that PDF uncertainties on predictions for the 
W, Z rapidity spectra, using standard PDF sets which describe all modern data, have reached a precision 
of ^ 8%. This may be good enough to consider using these processes as luminosity monitors. The 
predicted precision on ratios such as the lepton ratio, Ai, is better {r^ 4%) and this measurement may be 
used as a SM benchmark. It is likely that this current level of uncertainty will have improved before the 
LHC turns on- see the contribution of C. Gwenlan (section 3.2) to these proceedings. The remainder of 
this contribution will be concerned with the question: how accurately can we measure these quantities 
and can we use the early LHC data to improve on the current level of uncertainty? 

2. 3. 2 k-f actor and PDF re-weighting 

To investigate how well we can really measure W production we need to generate samples of Monte- 
Carlo (MC) data and pass them through a simulation of a detector. Various technical problems arise. 
Firstly, many physics studies are done with HERWIG (6.505) [20], which generates events at LO with 
parton showers to account for higher order effects. Distributions can be corrected from LO to NLO by 
k-factors which are applied as a function of the variable of interest. The use of HERWIG is gradually 
being superceded by MC@NLO (2.3) [21] but this is not yet implemented for all physics processes. Thus 
it is necessary to investigate how much bias is introduced by using HERWIG with k-factors. Secondly, to 
simulate the spread of current PDF uncertainties, it is necessary to run the MC with all of the eigenvector 
error sets of the PDF of interest. This would be unreasonably time-consuming. Thus the technique of 
PDF reweighting has been investigated. 

One million W ev^ events were generated using HERWIG (6.505). This corresponds to 43 
hours of LHC running at low luminosity, 10/6"^. The events are split into and W~ events according 
to their Standard Model cross-section rates, 58%: 42% (the exact spUt depends on the input PDFs). These 
events are then weighted with k-factors, which are analytically calculated as the ratio of the NLO to LO 
cross-section as a function of rapidity for the same input PDF [22]. The resultant rapidity spectra for 
W~^,W^ are, compared to rapidity spectra for ~ 107,700 events generated using MC@NLO(2.3) in 
Fig 8^^ The MRST02 PDFs were used for this investigation. The accuracy of this study is limited by the 

^'in MC@NLO the hard emissions are treated by NLO computations, whereas soft/coUinear emissions are handled by the 
MC simulation. In the matching procedure a fraction of events with negative weights is generated to avoid double counting. 
The event weights must be applied to the generated number of events before the effective number of events can be converted to 
an equivalent luminosity. The figure given is the effective number of events. 
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Fig. 8: Top Row: W rapidity and pt spectra for events generated witli HERWIG + k-Factors (full line), compared to those 
generated by MC@NLO (dashed line); left plot rapidity; middle plot rapidity; right plot pt- Bottom row: 
the fractional differences of the spectra generated by HERWIG + k-factors and those generated by MC@NLO. The full line 
represents the weighted mean of these difference spectra and the dashed lines show its uncertainty 



Statistics of the MC@NLO generation. Nevertheless it is clear that HERWIG with k-factors does a good 
job of mimicking the NLO rapidity spectra. However, the normalisation is too high by 3.5%. This is 
not suprising since, unlike the analytic code, HERWIG is not a purely LO calculation, parton showering 
is also included. This normalisation difference is not too crucial since in an analysis on real data the 
MC will only be used to correct data from the detector level to the generator level. For this purpose, 
it is essential to model the shape of spectra to understand the effect of experimental cuts and smearing 
but not essential to model the overall normalisation perfectly. However, one should note that HERWIG 
with k-factors is not so successful in modelling the shape of the pt spectra, as shown in the right hand 
plot of Fig. 8. This is hardly surprising, since at LO the W have no pt and non-zero pt for HERWIG 
is generated by parton showering, whereas for MC@NLO non-zero pt originates from additional higher 
order processes which cannot be scaled from LO, where they are not present. 

Suppose we generate W events with a particular PDF set: PDF set 1. Any one event has the 
hard scale, = M^, and two primary partons of flavours flavi and flav2, with momentum fractions 
xi,X2 according to the distributions of PDF set 1. These momentum fractions are applicable to the hard 
process before the parton showers are implemented in backward evolution in the MC. One can then 
evaluate the probability of picking up the same flavoured partons with the same momentum fractions 
from an alternative PDF set, PDF set 2, at the same hai^d scale. Then the event weight is given by 

PT^TP. . fpDFa (xi, fiavi, Q^).fpDF2(x2, flav2, Q^) 

PDF re - weight = (1) 

iPDFi (xi , fiavi , Q^) .tpDFi (x2 , flav2 , ) 

where xfpj:)p{x,flav,Q'^) is the parton momentum distribution for flavour, flav, at scale, Q^, and 
momentum fraction, x. Fig. 9 compares the and W~ spectra for a million events generated using 
MRST02 as PDF set 1 and re- weighting to CTEQ6.1 as PDF set 2, with a million events which are di- 
rectly generated with CTEQ6. 1. Beneath the spectra the fractional difference between these distributions 
is shown. These difference spectra show that the reweighting is good to better than 1%, and there is no 
evidence of a y dependent bias. This has been checked for reweighting between MRST02, CTEQ6.1 and 
ZEUS-S PDFs. Since the uncertainties of any one analysis are similar in size to the differences between 
the analyses it is clear that the technique can be used to produce spectra for the eigenvector error PDF 
sets of each analysis and thus to simulate the full PDF uncertainties from a single set of MC generated 
events. Fig. 9 also shows a similar comparison for pt spectra. 



2.3.3 Background Studies 

To investigate the accuracy with which W events can be measured at the LHC it is necessary to make 
an estimate of the importance of background processes. We focus on W events which are identified 
through their decay to the W ^ e Ve channel. There are several processes which can be misidentified 
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Fig. 9: Left side: W~ (left) and W (right) rapidity spectra, for events generated with MRST02 PDFs reweighted to CTEQ6. 1 
PDFs (full line), compared to events generated directly with CTEQ6.1 PDFs (dashed line). The fractional difference between 
these spectra are also shown beneath the plots. The full line represents the weighted mean of these difference spectra and the 
dashed lines show its uncertainty. IMfe^^ltecS?8?f (SP5iihffgfiH%ackground due to cuts 
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as — > efe- These are: W rur, with r decaying to the electron channel; Z t^t^ with at least 
one T decaying to the electron channel (including the case when both r's decay to the electron channel, 
but one electron is not identified); Z — > e"'"e~ with one electron not identified. We have generated one 
million events for each of these background processes, using HERWIG and CTEQ5L, and compared 
them to one million signal events generated with CTEQ6.1. We apply event selection criteria designed 
to eliminate the background preferentially. These criteria are: 

• ATLFAST cuts (see Sec. 2.3.5) 

• pseudorapidity, |r/| < 2.4, to avoid bias at the edge of the measurable rapidity range 

• Pte > 25 GeV, high pt is necessary for electron triggering 

• missing Et > 25 GeV, the in a signal event will have a correspondingly large missing Et 

• no reconstructed jets in the event with pt > 30 GeV, to discriminate against QCD background 

• recoil on the transverse plane < 20 GeV, to discriminate against QCD background 

Table 2 gives the percentage of background with respect to signal, calculated using the known relative 
cross-sections of these processes, as each of these cuts is applied. After, the cuts have been applied the 
background from these processes is negligible. However, there ai^e limitations on this study from the fact 
that in real data there will be further QCD backgrounds from 2^2 processes involving q,q,g in which 
a final state vr*^ — > 77 decay mimics a single electron. A preliminary study applying the selection criteria 
to MC generated QCD events suggests that this background is negligible, but the exact level of QCD 
background cannot be accurately estimated without passing a very large number of events though a full 
detector simulation, which is beyond the scope of the current contribution. 
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Fig. 10: The rates of charge misidentification as a function of rapidity for e misidentified as e (left), e misidentifed as e 
(right). 



2.3.4 Charge misidentification 

Clearly charge misidentification could distort the lepton rapidity spectra and dilute the asymmetry Ai. 

^^^'^ ~ l_F- -F+ 

where A^aw is the measured asymmetry, ^t^we is the true asymmetry, F~ is the rate of true e~ misiden- 
tified as e+ and is the rate of true e"*" misidentified as e~ . To make an estimate of the importance of 
charge misidentification we use a sample of Z ^ e^e~ events generated by HERWIG with CTEQ5L 
and passed through a full simulation of the ATLAS detector. Events with two or more charged electro- 
magnetic objects in the EM calorimeter are then selected and subject to the cuts; \7]\ < 2.5, pte > 25 
GeV, as usual and, E/p < 2, for bremsstrahlung rejection. We then look for the charged electromagnetic 
pair with invariant mass closest to Mz and impose the cut, 60 < Mz < 120 GeV. Then we tag the 
charge of the better reconstructed lepton of the pair and check to see if the charge of the second lepton is 
the same as the first. Assuming that the pair really came from the decay of the Z this gives us a measure 
of charge misidentification. Fig 10 show the misidentification rates F^, F~ as functions of pseudorapid- 
ity^^. These rates are very small. The quantity Ai, can be corrected for charge misidentification applying 
Barlow's method for combining asymmetric eiTors [23]. The level of correction is 0.3% in the central 
region and 0.5% in the more forward regions. 



2.3.5 Compare events at the generator level to events at the detector level 

We have simulated one million signal, W — > eVe, events for each of the PDF sets CTEQ6. 1, MRST2001 
and ZEUS-S using HERWIG (6.505). For each of these PDF sets the eigenvector eiTor PDF sets have 
been simulated by PDF reweighting and k-factors have been applied to approximate an NLO generation. 
The top part of Fig. 1 1 shows the and Ai spectra at this generator level, for all of the PDF sets 
superimposed. The events are then passed through the ATLFAST fast simulation of the ATLAS detector. 
This applies loose kinematic cuts: |r/| < 2.5, pte > 5 GeV, and electron isolation criteria. It also smears 
the 4-momenta of the leptons to mimic momentum dependent detector resolution. We then apply the 
selection cuts described in Sec. 2.3.3. The lower half of Fig. 11 shows the and Ai spectra at the 
detector level after application of these cuts, for all of the PDF sets superimposed. The level of precision 
of each PDF set, seen in the analytic calculations of Fig. 6, is only slightly degraded at detector level, so 
that a net level of PDF uncertainty at central rapidity of ~ 8% is maintained. The anticipated cancellation 
of PDF uncertainties in the asymmetry spectrum is also observed, within each PDF set, and the spread 
between PDF sets suggests that measurements which are accurate to better than ~ 5% could discriminate 
between PDF sets. 

^^These have been corrected for the small possibility that the better reconstructed lepton has had its charge misidentified as 
follows. In the central region, < 1, assume the same probability of misidentification of the first and second leptons, in the 
more forward regions assume the same rate of first lepton misidentification as in the central region. 
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Fig. 11: Top row: e" , and A^. rapidity spectra for the lepton from the W decay, generated using HERWIG + k factors and 
CTEQ6. 1 (red), ZEUS-S (green) and MRST2001 (black) PDF sets with full uncertainties. Bottom row: the same spectra after 
passing through the ATLFAST detector simulation and selection cuts. 

2.3.6 Using LHC data to improve precision on PDFs 

The high cross-sections for W production at the LHC ensure that it will be the experimental systematic 
errors, rather than the statistical errors, which are determining. We have imposed a random 4% scat- 
ter on our samples of one million W events, generated using different PDFs, in order to investigate if 
measurements at this level of precision will improve PDF uncertainties at central rapidity significantly 
if they are input to a global PDF fit. Fig. 12 shows the e+ and e~ rapidity spectra for events generated 
from the ZEUS-S PDFs {\7]\ < 2.4) compared to the analytic predictions for these same ZEUS-S PDFs. 
The lower half of this figure illustrates the result if these events are then included in the ZEUS-S PDF 
fit. The size of the PDF uncertainties, at y = 0, decreases from 5.8% to 4.5%. The largest improvement 
is in the PDF parameter controlling the low-x gluon at the input scale, Q^: xg{x) ~ x-^b at low-x, 
\g = -0.199 ± 0.046, before the input of the LHC pseudo-data, compared to, = -0.196 ± 0.029, 
after input. Note that whereas the relative normalisations of the e"*" and e~ spectra are set by the PDFs, 
the absolute normalisation of the data is free in the fit so that no assumptions are made on our ability to 
measure luminosity. Secondly, we repeat this procedure for events generated using the CTEQ6.1 PDFs. 
As shown in Fig. 13, the cross-section for these events is on the lower edge of the uncertainty band of 
the ZEUS-S predictions. If these events are input to the fit the central value shifts and the uncertainty de- 
creases. The value of the parameter Xg becomes, Xg = —0.189 ± 0.029, after input of these pseudo-data. 
Finally to simulate the situation which really faces experimentalists we generate events with CTEQ6.1, 
and pass them through the ATLFAST detector simulation and cuts. We then correct back from detector 
level to generator level using a different PDF set- in this case the ZEUS-S PDFs- since in practice we will 
not know the true PDFs. Fig. 14 shows that the resulting corrected data look pleasingly like CTEQ6.1, 
but they are more smeared. When these data are input to the PDF fit the central values shift and errors 
decrease just as for the perfect CTEQ6. 1 pseudo-data. The value of Ac, becomes, A = —0.181 ± 0.030, 
after input of these pseudo-data. Thus we see that the bias introduced by the correction procedure from 
detector to generator level is small compared to the PDF uncertainty. 

2.3. 7 Conclusions and a warning: problems with the theoretical predictions at small-x? 

We have investigated the PDF uncertainty on the predictions for W and Z production at the LHC, taking 
into account realistic expectations for measurement accuracy and the cuts on data which will be needed 
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Fig. 12: Top row: e and e rapidity spectra generated from ZEUS-S PDFs compared to thie analytic prediction using ZEUS- 
S PDFs. Bottom row: the same lepton rapidity spectra compared to the analytic prediction AFTER including these lepton 
pseudo-data in the ZEUS-S PDF fit. 




Fig. 13: Top row: and e rapidity spectra generated from CTEQ6. 1 PDFs compared to the analytic prediction using ZEUS- 
S PDFs. Bottom row: the same lepton rapidity spectra compared to the analytic prediction AFTER including these lepton 
pseudo-data in the ZEUS-S PDF fit. 



23 



Fig. 14: Top row: e and rapidity spectra generated from CTEQ6. 1 PDFs, which have been passed through the ATLFAST 
detector simulation and corrected back to generator level using ZEUS-S PDFs, compared to the analytic prediction using 
ZEUS-S PDFs. Bottom row: the same lepton rapidity spectra compared to the analytic prediction AFTER including these 
lepton pseudo-data in the ZEUS-S PDF fit. 

to identify signal events from background processes. We conclude that at the present level of PDF 
uncertainty the decay lepton asymmetry, Ai , will be a useful standard model benchmark measurement, 
and that the decay lepton spectra can be used as a luminosity monitor which will be good to ~ 8%. 
However, we have also investigated the measurement accuracy necessary for early measurements of 
these decay lepton spectra to be useful in further constraining the PDFs. A systematic measurement 
error of less than ~ 4% would provide useful extra constraints. 

However, a caveat is that the current study has been performed using standard PDF sets which 
are extracted using NLO QCD in the DGLAP [24-27] formalism. The extension to NNLO is straight- 
forwai^d, giving small corrections ~ 1%. PDF analyses at NNLO including full accounting of the PDF 
uncertainties are not extensively available yet, so this small correction is not pursued here. However, there 
may be much larger uncertainties in the theoretical calculations because the kinematic region involves 
low-x. There may be a need to account for ln{l/x) resummation (first considered in the BFKL [28-30] 
formalism) or high gluon density effects. See reference [31] for a review. 

The MRST group recently produced a PDF set, MRST03, which does not include any data for 
X < 5 X 10~^. The motivation behind this was as follows. In a global DGLAP fit to many data sets there 
is always a certain amount of tension between data sets. This may derive from the use of an inappropriate 
theoretical formalism for the kinematic range of some of the data. Investigating the effect of kinematic 
cuts on the data, MRST found that a cut, x > 5 x 10"^, considerably reduced tension between the 
remaining data sets. An explanation may be the inappropriate use of the DGLAP formalism at small-x. 
The MRST03 PDF set is thus free of this bias BUT it is also only valid to use it for x > 5 x 10"^. 
What is needed is an alternative theoretical formalism for smaller x. However, the MRST03 PDF set 
may be used as a toy PDF set, to illustrate the effect of using very different PDF sets on our predictions. 
A comparison of Fig. 15 with Fig. 3 or Fig. 6 shows how different the analytic predictions are from the 
conventional ones, and thus illustrates where we might expect to see differences due to the need for an 
alternative formalism at small-x. 



24 




Fig. 15: LHC W+, VK", Z rapidity distributions for the MRST03 PDFs: left plot 1^+; middle plot VF" ; right plot Z 



2.4 W and Z production at the LHC^^ 

The study of the production at the LHC of the electroweak bosons W and Z with subsequent decays 
in leptonic final states will provide several precision measurements of Standard Model parameters such 
as the mass of the W boson or the weak mixing angle from the Z boson forward-backward asymmetry. 
Given their large cross section and clean experimental signatures, the bosons will furthermore serve 
as calibration tool and luminosity monitor. More challenging, differential cross sections in rapidity or 
transverse momentum may be used to further constrain parton distribution functions. Eventually these 
measurements for single inclusive boson production may be applied to boson pair production in order to 
derive precision predictions for background estimates to discovery channels like H — > W^W~. 

This contribution is devoted to the estimation of cun^ent uncertainties in the calculations for Stan- 
dard Model cross sections involving W and Z bosons with particular emphasis on the PDF and per- 
turbative uncertainties. All results are obtained at NLO with MCFM [32] version 4.0 interfaced to 
LHAPDF [17] for a convenient selection of various PDF families and evaluation of their intrinsic uncer- 
tainties. The cross sections are evaluated within a typical experimental acceptance and for momentum 
cuts summarised in Table 3. The electromagnetic decays of W and Z are considered (massless leptons) 
and the missing transverse energy is assigned to the neutrino momentum sum (in case of W decays). 
Jets in the processes W/Z + jets are produced in an inclusive mode with at least one jet in the event 

Table 3: Experimental acceptance cuts used for the calculation of cross-sections. 



Observable 




cut 


lept 

Pt 




> 25 GeV 


jet 

Pt 




> 25 GeV 


heptl 




< 3.0 






< 4.0 


R{lepton — 


jet) 


> 0.8 


R{lepton — 


lepton) 


> 0.2 


Tpmiss 

Ulrp 




>25 GeV 



reconstructed with the fc^^-algorithm. MCFM includes one- and two-jet processes at NLO and thi^ee-jet 
processes at LO. In the case of boson pair production the cuts of Table 3 can only be applied to the two 
leading leptons, hence a complete acceptance is assumed for additional leptons e.g. from ZZ or WZ 
decays. 

The calculations with MCFM are carried out for a given fixed set of electroweak input parame- 
ters using the effective field theory approach [32]. The PDF family CTEQ61 provided by the CTEQ 
collaboration [33] is taken as nominal PDF input while MRST2001E given by the MRST group [34] is 

Contributing author: Hasko Stenzel 
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considered for systematic purposes. The difference between CTEQ61 and MRST2001E alone can't be 
considered as systematic uncertainty but merely as cross-check. The systematic uncertainty is therefore 
estimated for each family separately with the family members, 40 for CTEQ61 and 30 for MRST2001E, 
which are variants of the nominal PDF obtained with different assumptions while maintaining a reason- 
able fit of the input data. The value of as is not a free input parameter for the cross section calculation 
but taken from the coiTcsponding value in the PDF. 

Important input parameters are renormaUsation and factorisation scales. The central results are 
obtained with jiR = jip = My, V = W,Z for single boson production and ij>r = fip = My + My 
for pair production (V' being the second boson in the event). Missing higher orders are estimated by 
a variation of the scales in the range 1/2 < x^r < 2 and independently 1/2 < Xfj^p < 2 where 
fi = Xfj_ ■ My, following prescriptions applied to other processes [35], keeping in mind that the range of 
variation of the scales is purely conventional. 

2.4.1 Single W and Z cross sections 

Detailed studies of single W and Z production including detector simulation are presented elsewhere in 
these proceedings, here these channels are mainly studied for comparison with the associated production 
with explicitly reconstructed jets and with pair production. The selected process is inclusive in the sense 
that additional jets, present in the NLO calculation, are not explicitly reconstructed. The experimentally 
required lepton isolation entailing a jet veto in a restricted region of phase space is disregai^ded at this 
stage. 

As an example the pseudo-rapidity distribution of the lepton from W decays and the px spectra for 
Z and are shown in fig. 16. The cross section for is larger than for W~ as a direct consequence 
of the difference between up- and down-quark PDFs, and this difference survives in the pseudo-rapidity 
distribution of the decay lepton with a maximum around |?7|=2.5. In the central part the PDF uncertainty, 
represented by the bands in fig. 16, amounts to about 5% for CTEQ and 2% for MRST, and within the 
uncertainty CTEQ and MRST are fully consistent. Larger differences ai^e visible in the peaks for the 
, where at the same time the PDF uncertainty increases. In the ratio /W~ the PDF uncertainty 
is reduced to about 1-2% in the central region and a difference of about 3% is observed between CTEQ 
and MRST, as can be seen from the double-ratio CTEQ/MRST. The uncertainty of the double ratio is 
calculated from the CTEQ uncertainty band alone. 

In the case of Z production the rapidity and px spectra can be fully reconstructed from the e+e" 
pair. A measurement of the Z px spectrum may be used to tune the Monte Carlo description of W 
Pt, which is relevant for measurements of the W mass. The px spectra are shown in the right part of 
fig. 16. The total yield for is about six times larger than for Z^ but for pT > 150 GeV the ratio 
stabilises around 4.5. At small values of pr the fixed-order calculation becomes trustless and should be 
supplemented by resummed calculations. The PDF uncertainties for the pT spectra themselves are again 
about 5% and about 2% in the ratio, CTEQ and MRST being consistent over the entire pT range. 

The perturbative uncertainties are estimated by variations of the renormaUsation and factorisation 
scales in by a factor of two. The scale variation entails a global change in the total cross section of 
the order of 5%. The r] distribution of leptons from W/Z decays are shown in fig. 17, comparing the 
nominal cross section with x^r = x^p = 1, to alternative scale settings. The nominal cross section 
is drawn with its PDF uncertainty band, illustrating that the perturbative uncertainties ai^e of the same 
size. For W~ and the shape of the distribution is essentially unaltered, but for the region around 
the maxima is changed more than the central part, leading to a shape deformation. The scale variation 
uncertainty is strongly correlated for W~ and Z^ and cancels in the ratio W~ /Z^, but for it is 
almost anti-coiTelated with and Z^ and partly enhanced in the ratio. 

Globally the perturbative uncertainty is dominated by the asymmetric scale setting x^r = 2, x^r = 
1/2 for which a change of —5% is observed, the largest upward shift of 3.5% is obtained for x^/j = 
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Fig. 16: Left: pseudo-rapidity distribution of the decay lepton from inclusive W production and right: pr spectra of W and 
Z. The bands represent the PDF-uncertainty. The lower inserts show on the left side the ratio /W~ resp. the double-ratio 
CTEQ/MRST and on the right side the ratios for W+fZ". 




Fig. 17: Left: pseudo-rapidity distribution of the decay lepton from inclusive W/Z production for different values of x^r and 
x^F = 1, centre: the ratio of predictions with respect to = 1 and right: double ratio V/V' of cross sections for actual scale 
settings normalised to the nominal scale. 
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2, Xfj_R = 2, locally the uncertainty for can be much different. It can be expected that the perturba- 
tive uncertainties ai^e reduced for NNLO calculations to the level of 1%. 

The integrated cross sections and systematic uncertainties within the experimental acceptance are 
summarised in Table 4. 

Table 4: Total cross-sections and systematic uncertainties within the experimental acceptance. 





w+ w- 


CTEQ61 [pb] 
A™ [pb] 

A™ [%] 


5438 4002 923.9 
±282 ±221 ±49.1 
±5.2 ±5.5 ±5.3 


MRST [pb] 
[pb] 

A^^l^ [%] 


5480 4110 951.1 
±103 ±83.4 ±17.4 
±1.9 ±2.1 ±1.9 


Apert [%] 


±3.5 ±3.5 ±3.1 
-5.2 -5.4 -5.5 



2.4.2 W/Z ± jet production 

In the inclusive production of W/Z ± jet at least one jet is requested to be reconstructed, isolated from 
any lepton by i? > 0.8. Additional jets are in case of overlap eventually merged at reconstruction level 
by the fcr-prescription. Given the presence of a relatively hard (jjt > 25 GeV) jet, it can be expected 
that PDF- and perturbative uncertainties aie different than for single boson production. The study of this 
process at the LHC, other than being a stringent test of perturbative QCD, may in addition contribute to 
a better understanding of the gluon PDF. 

The first difference with respect to single boson production appears in the lepton pseudo-rapidities, 
shown in fig. 18. The peaks in the lepton spectrum from disappeared, the corresponding spectrum 
from W~ is stronger peaked at central rapidity while the ratio /W~ with jets is essentially the same 
as without jets. The PDF uncertainties are slightly smaller (4.2-4.4%) compared to single bosons. The 
jet pseudo-rapidities are shown in the right part of fig. 18, they are much stronger peaked in the central 
region but the ratio /W~ for jets is similar to the lepton ratio. 

The transverse momenta of associated jets from W/Z ± jet production is shown in fig. 19, the 
spectra ai^e steeply falling and the ratio W^ /W~ is increasing from 1.3 at low pT to almost 2 at 500 
GeV PT. 

The perturbative uncertainties are investigated in the same way as for the single boson production 
and are shown in fig. 20. The scale variation entails here a much larger uncertainty between 8 and 10%, 
almost twice as lai^ge as for single bosons. In contrast to the latter case, the scale variation is con^elated 
for W and Z and cancels in the ratio W^ /W~, with an exception for W~ where a bump appears at 
= 1.8 for = 2. 

The total cross sections and their systematic uncertainties are summarised in Table 5. 
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Fig. 18; Left: pseudo-rapidity distribution of tire decay lepton from inclusive VK+jet production and right: pseudo-rapidity of 
the associated leading jet. The bands represent the PDF-uncertainty. 



Table 5: Total cross-sections and systematic uncertainties within the experimental acceptance for W/Z + jet processes. 





W+ + jet 


W- ± jet 


+ jet 


CTEQ61 [pb] 


1041 


784.5 


208.1 


[pb] 


±44.1 


±34.3 


±9.01 


A™ [%] 


±4.2 


±4.4 


±4.3 


MRST [pb] 


1046 


797.7 


211.3 


A|/^|^ [pb] 


±17.6 


±14.8 


±3.67 


Af «r [%] 


±1.7 


±1.9 


±1.8 


Apert [%] 


±8.7 


±8.9 


±7.6 




-9.8 


-10.0 


-9.1 



29 



LHC pp^W/Z+jet 



e 

06 

D 



10' 



10 



1.9 
1.8 

, 1.7 

^1.6 

^1.5 
o 

^1.3 
1.2 
1.1 



'A 



CTEQ61 ± uncertainty 



MRST2001 ± uncertainty 



NLO (MCFM) 



w 



1.1 



0.9 



o cc - 

« a- 



200 400 



50 100 150 200 250 300 350 400 450 500 



Fig. 19: Transverse momentum distribution of tire jet from inclusive W/Z + jet production 




Fig. 20: Left: pseudo-rapidity distribution of the decay lepton from inclusive W/Z + jet production for different values of 
Xij,R and XfiF ~ 1, centre: the ratio of predictions with respect to a;^ = 1 and right: double ratio V/V' of cross sections for 
actual scale settings normalised to the nominal scale. 
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'Ilcpton P^ltplon 

Fig. 21 : Left: pseudo-rapidity distribution of the decay lepton from inclusive WW production and right: transverse momentum 
of the decay lepton. 

2.4.3 Vector Boson pair production 

In the Standard Model the non-resonant production of vector bosons pairs in the continuum is suppressed 
by factors of lO'^-lO^ with respect to single Boson production. The cross sections for WW, WZ and ZZ 
within the experimental acceptance range from 500 fb (VFM^) to 10 fb (ZZ). Given the expected limited 
statistics for these processes, the main goal of their experimental study is to obtain the best estimate of 
the background they represent for searches of the Higgs boson or new physics yielding boson pairs. 

The selection of boson pairs follows in extension the single boson selection cuts applied to 2, 3 
or 4 isolated leptons. Again real gluon radiation and virtual loops have been taken into account at NLO 
but without applying lepton-jet isolation cuts. Lepton-lepton separation is considered only for the two 
leading leptons. 

The pseudo-rapidity and transverse momentum distributions taking the from W^W~ produc- 
tion as example ai^e shown in fig.21. The pseudo-rapidity is strongly peaked and the cross section at 
?7 = twice as large as at |r/| = 3. The PDF uncertainties are smaller than for single bosons, between 
3.5 and 4 %. 

The same shape of lepton distributions is also found for the other lepton and for the other pair 
production processes, as shown for the W~ Z^ case in fig.22. 

The rapidity distribution of the leading Z^ from ZZ production is shown in the left part of fig.23. 
With both Z's being fully reconstructed, the invariant mass of the ZZ system can be compared in the 
right part of fig.23 to the invariant mass spectrum of the Higgs decaying into the same final state for an 
intermediate mass of mu = 200 GeV. In this case a clear peak appears at low invariant masses above 
the continuum, and the mass spectrum is also harder at high masses in presence of the Higgs. 
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Fig. 23: Left: rapidity distribution of tlie leading Z from inclusive ZZ production and right: invariant mass of the ZZ pair for 
non-resonant continuum production compared to resonant pair production via the SM Higgs decay. 
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Ilepton 'l^P'"" 

Fig. 24: Left: pseudo-rapidity distributions of leptons from various boson pair production processes and different scale settings 
and right: ratio of predictions relative to — 1. 

The perturbative uncertainties, obtained as for the other processes, are shown in fig.24 for the 
lepton distributions. The systematic uncertainties range from 3.3 to 4.9 % and are slightly smaller than 
for single bosons, given the larger scale fi = 2My and better applicability of perturbative QCD. The 
perturbative uncertainty is essentially constant across the pseudo-rapidity and largely correlated between 
different pair production processes. 

The ratio of boson pair production to single Z production is of particular interest, as similar quark 
configurations contribute to both process types, though evidently in a somewhat different x, regime. 
This ratio is shown in fig. 25 for the lepton distribution, given the different shapes of pseudo-rapidity is 
not flat but its PDF uncertainty is reduced to the level of 2 %. The perturbative uncertainties of the VV/Z 
ratio, however, are only reduced for the ZZ/Z case and even slightly larger for other ratios because the 
scale variations have partly an opposite effect on the cross sections for Z and e.g. WW production. 

The total cross sections and their systematic uncertainties are summarised in Table 6. 
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Fig. 25: Left: the ratio of pseudo-rapidity distributions of leptons from boson pair production processes normalised to single Z 
production and right: the double ratio W jZ of predictions for different scales relative to = 1. 



Table 6: Total cross-sections and systematic uncertainties within the experimental acceptance for pair production processes. 





WW ZZ W+Z^ W'Z^ 


CTEQ61 [fb] 
A™ [fb] 
A™ [%] 


475.7 11.75 31.81 20.77 
±17.0 ±0.48 ±1.12 ±0.80 
±3.6 ±4.1 ±3.5 ±3.8 


MRST [fb] 
Af «r [fb] 

A|/«r [%] 


494.2 12.34 32.55 21.62 
±6.3 ±0.19 ±0.49 ±0.41 
±1.3 ±1.6 ±1.5 ±1.9 


Apert [%] 


±4.6 ±3.3 ±4.6 ±4.8 
-4.9 -3.8 -4.7 -4.7 
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2.5 Study of next-to-next-to-leading order QCD predictions for W and Z production at LHC 

It has been in 2004 that the first differential next-to-next-to-Ieading order (NNLO) QCD calculation 
for vector boson production in hadron collisions was completed by Anastasiou et al. [36]. This group 
has calculated the rapidity dependence for W and Z production at NNLO. They have shown that the 
perturbative expansion stabilizes at this order in perturbation theory and that the renormalization and 
factorization scale uncertainties are drastically reduced, down to the level of one per-cent. It is therefore 
interesting to perform a more detailed study of these NNLO predictions for various observables which 
can be measured at LHC, as well as to investigate their systematic uncertainties. 

In the study presented here we have calculated both the differential (in rapidity) and inclusive 
cross sections for W, Z and high-mass Drell-Yan (Z/7*) production. Here "inclusive" refers to the results 
obtained by integrating the differential cross sections over a rapidity range similar to the experimentally 
accessible region, which might be more relevant than the complete cross section which also includes the 
large-rapidity tails. 

Such a prediction would then be compared to the experimental measurements at LHC, which will 
allow for precise tests of the Standard Model as well as to put strong constraints on the parton distribution 
functions (PDFs) of the proton. It is clear that in the experiment only the rapidity and transverse momenta 
of the leptons from the vector boson decays will be accessible, over a finite range in phase space. In 
order to compute the rapidity of the vector boson by taking into account the finite experimental lepton 
acceptance, Monte Carlo simulations have to be employed which model vector boson production at 
the best possible precision in QCD, as for example the program MC@NLO [21]. The so computed 
acceptance corrections will include further systematic uncertainties, which are not discussed here. 

2.5.1 Parameters and analysis method 

The NNLO predictions have been implemented in the computer code VRAP [37], which has been mod- 
ified in order to include ROOT [38] support for producing ntuples, histograms and plots. The code 
allows to specify the collision energy (14 TeV in our case), the exchanged vector boson (7*, Z, Z/7*, 
W"*", W^), the scale Q of the exchanged boson (Mz, Mw or off-shell, e.g. Q = 400 GeV), the renor- 
malization and factorization scales, the invariant mass of the di-lepton system (fixed or integrated over 
a specified range), the value of the electro-magnetic coupling (oqed = 1/128 or oqedIQ)) and the 
number of light fermions considered. Regarding the choice of pdfs, the user can select a pdf set from the 
MRST2001 fits [39] or from the ALEKHIN fits [40], consistent at NNLO with variable flavour scheme. 
We have chosen the MRST2001 NNLO fit, mode 1 with as{Mz) = 0.1155 [39], as reference set. 

The program is run to compute the differential cross section da/dY , Y being the boson rapidity, 
at a fixed number of points in Y. This result is then parametrized using a spline interpolation, and the 
thus found function can be integrated over any desired rapidity range, such as \Y\ < 2,\Y\ < 2.5 or 
|y| < 3, as well as over finite bins in rapidity. For the study of on-shell production the integration range 
over the di-lepton invariant mass Mn was set to My — SFy < Mu < My + SFy, with My and Ty the 
vector boson mass and width. This simulates an experimental selection over a finite signal range. 

The systematic uncertainties have been divided into several categories: The PDF uncertainty is 
estimated by taking the maximum deviation from the reference set when using different PDFs from 
within the MRST2001 set or the ALEKHIN set. The latter difference is found to give the maximal 
variation in all of the investigated cases. The renormalization and factorization scales = /^r, i^ly have 
been varied between 0.5 < /x/Q < 2, both simultaneously as well as fixing one to fi = Q and varying the 
other. The maximum deviation from the reference setting ^ = Q is taken as uncertainty. The observed 
difference when using either a fixed or a running electro-magnetic coupling constant is also studied as 
possible systematic uncertainty due to higher-order QED effects. Since it is below the one per-cent 
level, it is not discussed further. Finally, in the case of Z production it has been checked that neglecting 

^'^Contributing author: Gunther Dissertori 
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photon exchange and interference contributions is justified in view of the much larger PDF and scale 
uncertainties. 



2. 5. 2 Results forW and Z production 
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Fig. 26: Left : Drell-Yan Z production cross section (x BR) at LHC energies, as a function of the Z rapidity, for two different 
PDF choices. Right : Zoom into a restricted rapidity region, with the ratio of the predictions for the two different PDF sets as 
lower inset. The error bars indicate the scale uncertainties. 



In Figure 26 the results for Z production at LHC are shown for two different choices of PDF set, as 
a function of the boson rapidity. It can be seen that the predictions differ by about 2% at central rapidity, 
and the difference increases to about 5% at large rapidity. A similar picture is obtained when integrating 
the differential cross section up to rapidities of 2, 2.5 and 3 (Table 7). The more of the high-rapidity 
tail is included, the larger the uncertainty due to the PDF choice. From Table 1 it can also be seen that 
the scale uncertainties are slightly below the one per-cent level. It is worth noting that the choice of the 
integration range over the di-lepton invariant mass can have a sizeable impact on the cross section. For 
example, increasing the range from the standard value to 66 GeV < Mz < 116 GeV increases the cross 
section by 8%. 



Channel 


Z prod. 


W prod. 


range 


\Y\ < 2 


|y| < 2.5 


\Y\ < 3 


\Y\ < 2 


|y| < 2.5 


\Y\ < 3 


cross section [nb] 


0.955 


1.178 


1.384 


9.388 


11.648 


13.800 


A PDF [%] 


2.44 


2.95 


3.57 


5.13 


5.47 


5.90 


A scale [%] 


0.85 


0.87 


0.90 


0.99 


1.02 


1.05 



Table 7: NNLO QCD results for W and Z production at the LHC for the integration over different rapidity ranges. Also given 
are the relative uncertainties due to the choice of the PDFs and of the renormalization and factorization scale. The numbers 
include the branching ratio Z(W) ee{ev). 



The results for W production (Table 7) have been obtained by first calculating separately the cross 
sections for W"*" and W~ production, and then adding these up. Again we observe an increase of the 
PDF uncertainty when going to larger rapidity ranges. Compared to the Z production, here the PDF 
uncertainties are larger, between 5 and 6%, whereas the scale uncertainties are of the same level, k, 1%. 
It is interesting to note that the PDF uncertainty for W~ production is about 10 - 20% (relative) lower 
than that for W+. 

A considerable reduction in systematic uncertainty can be obtained by calculating cross section 
ratios. Two options have been investigated, namely the ratios cj(W'^)/(t(W~) and cj(W)/(t(Z). As can 
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be seen from Figure 27, the PDF uncertainties are reduced to the 0.7% level in the former ratio, and to 
about 2% in the latter. The scale uncertainties are reduced to the 0.15% level in both cases. Taking such 
ratios has also the potential advantage of reduced experimental systematic uncertainties, such as those 
related to the acceptance corrections. 
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Fig. 27: Ratio of the production cross sections for W"'", W (left), and W, Z (rigfit), as a function of rapidity, for two different 
PDF sets. Tiie inserts show the ratios of the results for the two PDF choices. 



2. 5. 3 Results for high-mass Drell- Yan processes 

Similarly to on-shell W and Z production we have also analyzed the high-mass Drell- Yan process, 
namely Z/7* production at a scale of Q = 400 GeV. In this case the di-lepton invariant mass has 
been integrated over the range Mu = 400 ± 50 GeV. Here the PDF uncertainties are found between 
3.7% and 5.1% for the various integration ranges over rapidity, somewhat larger than for on-shell pro- 
duction. However, by normalizing the high-mass production cross section to the on-shell case, the PDF 
uncertainties ai^e considerably reduced, being 1.2 - 1.5%. 

The systematic uncertainties related to the renormalization and factorization scale are reduced 
(A scale 0.2%) when going to the high-mass exchange, as expected from perturbative QCD with a 
decreasing strong coupling constant. In this case a normalization of the cross section to the on-shell 
case does not give an improvement. However, since the scale uncertainties are well below the PDF 
uncertainties, this is less of an issue for the moment. 

2.5.4 Summary 

We have studied NNLO QCD predictions for W and Z production at LHC energies. We have identified 
the choice of PDF set as the dominant systematic uncertainty, being between 3 and 6%. The choice of 
the renormalization and factorization scale leads to much smaller uncertainties, at or below the 1% level. 
In particular we have shown that the systematic uncertainties can be sizeably reduced by taking ratios 
of cross sections, such as a{W+)/a{W-), a{W)/a{Z) or a{Zh*, Q = mGeV)/a{Z/j* ,Q = Mz). 
For such ratios it can be expected that also part of the experimental uncertainties cancel. With theoretical 
uncertainties from QCD at the few per-cent level the production of W and Z bosons will most likely be 
the best-known cross section at LHC. 

Concerning the next steps, it should be considered that at this level of precision it might become 
relevant to include also higher-order electro-weak corrections. In addition, since experimentally the bo- 
son rapidity will be reconstructed from the measured lepton momenta, a detailed study is needed to 
evaluate the precision at which the acceptance correction factors for the leptons from the boson de- 
cays can be obtained. For this Monte Carlo programs such as MC@NLO should be employed, which 
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combine next-to-leading-order matrix elements with parton showers and con^ectly take account of spin 
congelations. 
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3 Experimental determination of Parton Distributions 
3.1 Introduction 

With HERA cuiTently in its second stage of operation, it is possible to assess the potential precision 
limits of HERA data and to estimate the potential impact of the measurements which are expected at 
HERA-II, in particular with respect to the PDF uncertainties. 

Precision limits of the structure function analyses at HERA are examined in section 3.2. Since 
large amounts of luminosity are already collected, the systematic uncertainty becomes most important. A 
detailed study of error sources with particular emphasis on correlated errors for the upcoming precision 
analysis of the inclusive DIS cross section at low using 2000 data taken by the HI experiment is 
presented. A new tool, based on the ratio of cross sections measured by different reconstruction methods, 
is developed and its ability to qualify and unfold various coiTclated error sources is demonstrated. 

An important issue is the consistency of the HERA data. In section 3.3, the HI and ZEUS pub- 
lished PDF analyses are compared, including a discussion of the different treatments of correlated sys- 
tematic uncertainties. Differences in the data sets and the analyses are investigated by putting the HI data 
set through both PDF analyses and by putting the ZEUS and HI data sets through the same (ZEUS) anal- 
ysis, separately. Also, the HERA averaged data set (section 3.4) is put through the ZEUS PDF analysis 
and the result is compared to that obtained when putting the ZEUS and HI data sets through this analysis 
together, using both the Offset and Hessian methods of treating con^elated systematic uncertainties. 

The HERA experimental data can not only be cross checked with respect to each other but also 
combined into one common dataset, as discussed in section 3.4. In this respect, a method to combine 
measurements of the structure functions performed by several experiments in a common kinematic do- 
main is presented. This method generalises the standard averaging procedure by taking into account 
point-to-point congelations which are introduced by the systematic uncertainties of the measurements. 
The method is applied to the neutral and charged current DIS cross section data published by the HI and 
ZEUS collaborations. The averaging improves in particular the accuracy due to the cross calibration of 
the HI and ZEUS measurements. 

The flavour decomposition of the light quark sea is discussed in section 3.6. For low x and thus low 
domain at HERA only measurement of the photon exchange induced structure functions F2 and Ff, 
is possible, which is insufficient to disentangle individual quark flavours. A general strategy in this case 
is to assume flavour symmetry of the sea. Section 3.6 considers PDF uncertainties if this assumption 
is released. These uncertainties can be significantly reduced if HERA would run in deuteron-electron 
collision mode. 

The impact of projected HERA-II data on PDFs is estimated in section 3.7. In particular, next-to- 
leading order (NLO) QCD predictions for inclusive jet cross sections at the LHC centre-of-mass energy 
are presented using the estimated PDFs. A further important measurement which could improve un- 
derstanding of the gluon density at low x and, at the same time, provide consistency checks of the low 
QCD evolution is the measurement of the longitudinal structure function F^. Perspectives of this 
measurement are examined in section 3.5, while the impact of this measurement is also estimated in 
section 3.7. 

Further improvements for consistently including final-state observables in global QCD analyses 
are discussed in section 3.8. There, a method for "a posteriori" inclusion of PDFs, whereby the Monte 
Carlo run calculates a grid (in x and Q) of cross section weights that can subsequently be combined with 
an arbitrary PDF. The procedure is numerically equivalent to using an interpolated form of the PDF. The 
main novelty relative to prior work is the use of higher-order interpolation, which substantially improves 
the tradeoff between accuracy and memory use. An accuracy of about 0.01% has been reached for the 
single inclusive cross-section in the central rapidity region \y\ < 0.5 for jet transverse momenta from 100 
to SOOOGeV. This method will make it possible to consistently include measurements done at HERA, 

""^Subsection coordinators: A. Glazov, S. Moch 
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Tevatron and LHC in global QCD analyses. 



3.2 Precision Limits for HERA DIS Cross Section Measurement 

The published precision low cross section data [41] of the HI experiment became an important data 
set in various QCD fit analyses [18, 19,40,41]. Following success of these data the HI experiment plans 
to analyse a large data sample, taken during 2000 running period^', in order to reach precision limits of 
low inclusive cross sections measurements at HERA. The precision is expected to approach 1% level. 

The aim of this contribution is to calculate realistic error tables for 2000 HI data and pursue paths 
how to reach such a high precision. CoiTclated en^or sources aie studied in particular- and a new tool, 
based on the ratio of cross sections measured by different reconstruction methods, is developed. All 
eiTors, including correlated errors, are treated in the same manner as in [41]. EiTor tables are provided 
and used in QCD fit analysis, see Sec 3.7, in order to study the impact of the new data on PDFs. The new 
data are expected to reach higher precision level than [41] due to the following reasons 

• Larger data statistics - Statistical errors will decrease by factor of 1.5 — 2, compared to [41], 
depending on the kinematic region. 

• Very large Monte Carlo simulations (MC) - Due to a progress in computing a number of simulated 
events can be significantly increased in order to minimise statistical eiTor of MC, to understand 
uncorrelated errors and to estimate correlated errors more precisely. 

• During past years increasing knowledge, aniving from various HI analyses, enabled better under- 
standing of the detector and its components as well as improving quality of MC. 

• Data taking in 2000 was particularly smooth. Both HERA and HI were running at peak perfor- 
mance for HERA-I running period. 

This contribution uses existing 2000 data and MC ntuples along with the full analysis chain. It 
applies all preliminary technical work done on these data, including calibration, alignment, trigger studies 
etc. Quoted errors are assumed to be achieved in the final version of analysis yet the analysis has not 
been finalised, all the numbers in the paper are preliminary and may change in the publication. 

The uncertainties of the cross section measurement are divided into a number of different types. 
Namely, these are statistical uncertainties of the data, uncorrelated systematics and correlated systemat- 
ics. The term 'con^elated' refers to the fact that cross section measurements in kinematic bins are affected 
in a conelated way while different con^elated systematic enw sources ai^e considered uncoiTclated among 
each other. The classification of the systematic errors into types is sometimes straightforward (MC statis- 
tics is uncorrelated error source) but sometimes is rather arbitrary (radiative corrections are assumed to 
be uncoiTclated eiTor source). The main goal of this classification is to preserve coiTclation between data 
points while keeping the treatement as simple as possible. 

The cross section uncertainties depend on the method used to reconstruct event kinematics. There 
are various methods existing, involving a measurement of the scattered electron as well as of the hadronic 
finale state. In the following two of them, so called electron method and sigma method, are em- 
ployed [42]. The electron method uses only the measurement of the scattered electron, namely its energy 
and polar angle, while the sigma method uses both the scattered electron and the hadronic final state. An 
advantage of the sigma method is a proper treatment of QED radiation from the incoming beam electron 
(ISR). 

The statistical uncertainty of the data is typically 0.5-1%, depending on the kinematic region 
analysed and the definition of the kinematic bins. In the following we adapt the bin definition used 
in [41], apart from merging bins at low y which was done in the published data in order to increase 
statistics. 

The uncorrelated systematics consists from various contributions. A cross section uncertainty due 
to the Monte Carlo statistics is the one with very good potential to be minimised. In the following we 

^^Contributing authors: G. Lastovicka-Medin, A. Glazov, T. Lastovicka 
^^Data statistics will be increased further by adding data taken in year 1999. 
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Fig. 28; A scan of the cross section measurement change in % depending on a variation of (from top-left) electron energy, 
electron polar angle, hadronic final state calibration scale and noise level in LAr calorimeter (bottom-right). The sigma method 
(a) and the electron method (b) were used to reconstruct kinematics of events. 



assume 100 million simulated events to be used in analysis of 2000 data. Estimates were calculated with 
available 12 million simulated events and corresponding statistical eiTors scaled by a factor of a/100/12. 
As a result the uncertainty is very small and typically on the level of few permile. 

Additional contributions to the uncorrelated systematics are efficiencies. We assume for trigger 
efficiency 0.3% and backward tracker tracker efficiency 0.3% uncertainty. Radiative corrections are 
expected to affect the final cross section by 0.4%. 

Effect of correlated uncertainties on the cross section measurement is studied in the following 
manner. Particular source of correlated uncertainty, for instance the scattered electron energy measure- 
ment, is varied by assumed eiTor and the change of the measured cross section is quoted as the corre- 
sponding cross section measurement eiTor. An example of cross section change on various correlated 
error source is shown in Fig. 28 for bin of = 45 GeV^ and x = 0.005. The kinematics of events 
was reconstructed with the sigma method (a) and the electron method (b). Errors are calculated as so 
called standard eiTors of the mean in calculation of which the available Monte Carlo sample was split 
into nine sub-samples. It is cleai^ly seen that the cross section measurement with the sigma method in 
this kinematic bin is particularly sensitive to the electron energy measurement (top-left) and to noise 
description in LAr calorimeter (bottom-right). On the contrary, the electron polar angle measurement 
and the calibration of the hadronic final state play a little role. The electron method is mainly sensitive 
to the electron energy measurement. The importance of the systematic sources vary from bin to bin. 

There are five individual sources contributing to the correlated cross section uncertainties: 

• Uncertainties of 0.15% at Ef. = 27GeV and 1% at 7 GeV are assigned to the electron energy scale 
for the backward calorimeter. The uncertainty is treated as a linear function of interpolating 
between the results at 27 GeV and 7 GeV. 

• The uncertainty on the scattered electron polar angle measurement is 0.3 miad . The corresponding 
error on the cross section measurement is typically well below 1% but may be larger at lowest 
values of Q^. 

• The uncertainty on the hadronic energy scale comprises a number of systematic eiTor sources 
corresponding to the E—p^ decomposition: an uncertainty of the hadronic energy scale calibration 
of 2% for the central and forward calorimeter, an uncertainty of 3% for the fraction carried by 
tracks and a 5% uncertainty of the hadronic energy scale measured in backward calorimeter. 
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8: An example of the error table for C, 


)^ = 25 GeV^ for 2000 data, large Monte Carlo sample and suppressed systematic 
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compared to [1], see 


text for details. Absolute errors are shown. The table format is identical to the one published in [1] 
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Table 9: An example of the full error table for = 25 GeV^, published Hi data. The definition of kinematic bins is not 
identical to that in Table 8, some bins were merged to enlarge statistics. 

• The uncertainty on the hadronic energy scale is further affected by the subtracted noise in the 
calorimetery. The noise is described to the level of 10% and the corresponding error is propagated 
to the cross section uncertainty. The largest influence is in the low y region, which is measured 
with the sigma method. 

• The uncertainty due to the photoproduction background at large y is estimated from the normali- 
sation error of the PHOJET simulations to about 10%. At low and medium values of y < 0.5 it is 
negligible. 

The total systematic error is calculated from the quadratic summation over all sources of the un- 
correlated and correlated systematic uncertainties. The total error of the DIS cross section measurement 
is obtained from the statistical and systematical enws added in quadrature. 

An example of the full error table for kinematic bin of = 25 GeV^ is shown in Table 8. For 
a comparison the corresponding part of the published data from [41] is presented in Table 9. One can 
see that precision about 1% can be reached especially in four lowest x bins, where the electron method 
was used to reconstruct the event kinematics. The key contributions to the seen improvement in the cross 
section measurement precision are the electron energy measurement, very large Monte Carlo statistics, 
well understood noise in LAr calorimeter and precisely controlled efficiencies entering the analysis. 

Full eiTor table, covering the kinematic region of 5 < < 150 GeV^ and 0.01 < y < 0.6 was 
produced. The electron method was applied for kinematic bins at y > 0.1 while the sigma method 
otherwise. The measurement of the proton structure function F2 was simulated using fractal parametri- 
sation [43] for central values, accounting for all sources of correlated and uncorrected errors. This table 
was used to estimate effect of precise low data on the determination of proton PDFs from QCD fits. 

The fact that different kinematics reconstruction methods are affected differently by the con^elated 
systematic uncertainties may be employed as a tool to estimate these uncertainties. We define 
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Fig. 29: A scan of the cross section ratio R in bins of and y as a function of tiie hadronic final state calibration variation. 
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(2) 



to be the cross section measurement ratio, where the reduced cross section af'^ and o"^'* is mea- 
sured using the electron method and the sigma method, respectively. Kinematic bins, indexed by i, cover 
a region of the analysis phase space where both reconstruction methods are applicable for the measure- 
ment. The statistical error of Ri measurement is again evaluated by splitting the sample to a number of 
sub-samples and calculating the standard error of the mean. An example of a scan of the cross section ra- 
tio Ri dependence on the hadronic final state calibration variation in a bin of = 25 GeV^ and various 
inelasticity y is shown in Fig. 29. 

An error of a particular correlated uncertainty source j can be estimated by searching for lowest 
= — ' where summation runs over kinematic bins, cjj is the error of Ri measure- 

ment and Qj is the variation of the source j. However, since there is a number of correlated error sources 
the correct way to find correlated uncertainties is account for all of them. 

Unfolding of the correlated error sources can be linearised and directly solved by minimising the 
following function: 

1 ^p. 

(3) 



The partial derivatives for systematic source a,j are obtained from linear fits to distributions as shown 
in Fig. 29. Parameters and their respective errors are obtained by matrix inversion technique. 

The procedure was tested on available Monte Carlo sample for 2000 HI data. Half of the sample, 
six million events, was used to simulate data. Full analysis chain was applied to measure the cross section 
and thus Ri. Kinematic bins were selected according to Vo <Q^ < 60 GeV^ and 0.011 < y < 0.6, i.e. 
in the main region of the data. The results are shown in Fig. 30. Closed points correspond to unfolded 
errors of the electron energy measurement (top-left), hadronic final state calibration and noise in the LAr 
calorimeter (bottom-left). There is no sensitivity observed to the electron polar angle measurement. All 
values are within statistical errors compatible with zero, as expected. For the final analysis the statistical 
errors are expected to be approximately three times smaller due to the significantly larger statistics than 
used for the presented study. This will enable the method to gain sufficient control over systematic 
correlated errors. Apart from being able to evaluate calibration of the scattered electron and of the 
hadronic final state, it gives a very good handle on the LAr calorimeter noise. 
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Fig. 30: Errors on the electron energy measurement (top-left), hadronic scale calibration (top-right) and noise in LAr calorimeter 
(bottom-left). Open points correspond to scan in one correlated error source. Closed points show the result of complete 
unfolding, taking into account correlations. 



For a comparison, open points in Fig. 30 correspond to a scan in one correlated error source. 
Tlie statistical errors are smaller, as expected, and compatible with zero. However, the unfolding method 
is preferred since it takes into account all coiTclated error sources correctly. 

In summary, a study of the DIS cross section uncertainties realistically achievable at HERA has 
been performed. For x G 0.001 — 0.01 a precision of 1% can be reached across for a wide range of G 
5 — 150 GeV^, allowing improved estimate of W, Z production cross section in the central rapidity region 
of LHC. The accuracy of the DIS cross section measurement can be verified using different kinematic 
reconstruction methods available at the HERA collider. 
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Fig. 31: Left plot: Comparison of PDFs from ZEUS and HI analyses at — lOGeV^. Right plot: Comparison of gluon from 
ZEUS and HI analyses, at various Q^. Note that the ZEUS analysis total uncertainty includes both experimental and model 
uncertainties. 

3.3 Comparison and combination of ZEUS and HI PDF analyses 

Parton Density Function (PDF) determinations are usually global fits [18, 19,44], which use fixed target 
DIS data as well as HERA data. In such analyses the high statistics HERA NC e^p data, which span 
the range 6.3 X 10"^ < X < 0.65, 2.7 < < 30, OOOGeV^, have determined the low -X sea and gluon 
distributions, whereas the fixed target data have determined the valence distributions and the higher-x sea 
distributions. The v-Fe fixed target data have been the most important input for determining the valence 
distributions, but these data suffer from uncertainties due to heavy target corrections. Such uncertainties 
are also present for deuterium fixed target data, which have been used to determine the shape of the 
high-x d-valence quark. 

HERA data on neutral and chai^ged cuiTcnt (NC and CC) e'^p and e~p inclusive double differential 
cross-sections are now available, and have been used by both the HI and ZEUS collaborations [45,46] 
in order to determine the parton distributions functions (PDFs) using data from within a single experi- 
ment. The HERA high cross-section data can be used to determine the valence distributions, thus 
eliminating uncertainties from heavy target corrections. The PDFs are presented with full accounting 
for uncertainties from correlated systematic errors (as well as from statistical and uncorrected sources). 
Peforming an analysis within a single experiment has considerable advantages in this respect, since the 
global fits have found significant tensions between different data sets, which make a rigorous statistical 
treatment of uncertainties difficult. 

Fig. 31 compares the results of the HI and ZEUS analyses. Whereas the extracted PDFs are 
broadly compatible within errors, there is a noticeable difference in the shape of the gluon PDFs. Full 
details of the analyses are given in the relevant publications, in this contribution we examine the differ- 
ences in the two analyses, recapping only salient details. 

3. 3. 1 Comparing ZEUS and HI published PDF analyses 

The kinematics of lepton hadron scattering is described in terms of the variables Q^, the invariant mass 
of the exchanged vector boson, Bjorken x, the fraction of the momentum of the incoming nucleon taken 
by the struck quark (in the quaitparton model), and y which measures the energy transfer between the 
lepton and hadron systems. The differential cross-section for the NC process is given in terms of the 

^^Contributing authors: A. Cooper- Sarkar, C. Gwenlan 
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structure functions by 

<fa{e^p) _ 27ra2 



[Y+ F2{x, Q2) _ y2 ^^(^^ g2) ^ y. xF^ix, Q^)] , (4) 



where Y± = 1 it (1 — y)^. The structure functions F2 and xF^ are directly related to quark distributions, 
and their dependence, or scaling violation, is predicted by pQCD. At < 1000 GeV^ F2 domi- 
nates the charged lepton-hadron cross-section and for x < 10^^, F2 itself is sea quark dominated but its 
evolution is controlled by the gluon contribution, such that HERA data provide crucial information 
on low-x sea-quark and gluon distributions. At high Q^, the structure function xF^ becomes increas- 
ingly important, and gives information on valence quark distributions. The CC interactions enable us to 
separate the flavour of the valence distributions at high-x, since their (LO) cross-sections aie given by, 

-1^ = wrbik^-'' ^ " " ^ ■ 
-1^ = wrb^,^ [<" + + - ^> + ■ 

For both HERA analyses the QCD predictions for the structure functions are obtained by solving the 
DGLAP evolution equations [24-27] at NLO in the MS scheme with the renormalisation and factor- 
ization scales chosen to be Q^. These equations yield the PDFs at all values of provided they are 
input as functions of x at some input scale Qq. The resulting PDFs are then convoluted with coefficient 
functions, to give the structure functions which enter into the expressions for the cross-sections. For a 
full explanation of the relationships between DIS cross-sections, structure functions, PDFs and the QCD 
improved parton model see ref. [31]. 

The HERA data are all in a kinematic region where there is no sensitivity to target mass and 
higher twist contributions but a minimum cut must be imposed to remain in the kinematic region 
where perturbative QCD should be applicable. For ZEUS this is > 2.5 GeV^, and for HI it is 
Q"^ > 3.5 GeV^. Both collaborations have included the sensitivity to this cut as part of their model 
errors. 

In the ZEUS analysis, the PDFs for u valence, xuv{x), d valence, xdy{x), total sea, xS{x), 
the gluon, xg{x), and the difference between the d and u contributions to the sea, x{d — u), are each 
parametrized by the form 

pixP2(l -xpP(x), (5) 

where P{x) = 1 + p^x, at = TGeV^. The total sea xS = 2x{u + d + s + c + h), where q = qsea for 
each flavour, u = + Ugea, d = dy + dsea and q = qsea for all other flavours. The flavour structure of 
the light quark sea allows for the violation of the Gottfried sum rule. However, there is no information on 
the shape of the d — u distribution in a fit to HERA data alone and so this distribution has its shape fixed 
consistent with the Drell-Yan data and its normalisation consistent with the size of the Gottfried sum-rule 
violation. A suppression of the strange sea with respect to the non-strange sea of a factor of 2 at Qq, is 
also imposed consistent with neutrino induced dimuon data from CCFR. Parameters are further restricted 
as follows. The normalisation parameters, pi, for the d and u valence and for the gluon are constrained 
to impose the number sum-rules and momentum sum-rule. The p2 par ameter which constrains the low-x 
behaviour of the u and d valence distributions is set equal, since there is no information to constrain 
any difference. When fitting to HERA data alone it is also necessary to constrain the high-x sea and 
gluon shapes, because HERA-I data do not have high statistics at large-x, in the region where these 
distributions are small. The sea shape has been restricted by setting p4 = for the sea, but the gluon 
shape is constrained by including data on jet production in the PDF fit. Finally the ZEUS analysis has 
11 free PDF parameters. ZEUS have included reasonable variations of these assumptions about the 
input parametrization in their analysis of model uncertainties. The strong coupling constant was fixed to 
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as(M|) = 0.118 [47]. Full account has been taken of con^elated experimental systematic enws by the 
Offset Method, as described in ref [44,48]. 

For the HI analysis, the value of Qq = 4GeV^, and the choice of quark distributions which are 
parametrized is different. The quarks are considered as tt-type and d-type with different parametrizations 
for, xU = x{ui, + Usea + c), xD = x{dy + dsea + s), xU = x{u + c) and xD = x{d + s), with qsea = <i, 
as usual, and the the form of the quark and gluon parametrizations given by Eq. 5. For xD and xtj the 
polynomial, P(x) = 1.0, for the gluon and X-D, P(x) = (I+P43;), and for xt/, P(x) = {l+p^x+p^x'^). 
The parametrization is then further restricted as follows. Since the valence distributions must vanish as 
X — > 0, the low-x parameters, pi and p2 are set equal for xU and xtj, and for xD and xD. Since there is 
no information on the flavour structure of the sea it is also necessary to set p2 equal for xtJ and xD. The 
normaUsation, pi, of the gluon is determined from the momentum sum-rule and the p4 parameters for 
xU and xD are determined from the valence number sum-rules. Assuming that the strange and charm 
quark distributions can be expressed as x independent fractions, fs and /c, of the d and u type sea, gives 
the further constraint pi(i7) = pi{D){l — fs) / {1 — fc)- Finally there are 10 free parameters. HI has also 
included reasonable variations of these assumptions in their analysis of model uncertainties. The strong 
coupling constant was fixed to as{M^) = 0.1185 and this is sufficiently similar to the ZEUS choice 
that we can rule it out as a cause of any significant difference. Full account has been taken of conelated 
experimental systematic eiTors by the Hessian Method, see ref. [48]. 

For the ZEUS analysis, the heavy quark production scheme used is the general mass variable 
flavour number scheme of Roberts and Thome [49]. For the HI analysis, the zero mass variable flavour 
number scheme is used. It is well known that these choices have a small effect on the steepness of the 
gluon at very small-x, such that the zero-mass choice produces a slightly less steep gluon. However, 
there is no effect on the more striking differences in the gluon shapes at larger x. 

There are two differences in the analyses which are worth further investigation. The different 
choices for the form of the PDF parametrization at Qq and the different treatment of the con^elated 
experimental uncertainties. 

3.3.2 Comparing different PDF analyses of the same data set and comparing different data sets using 
the same PDF analysis. 

So far we have compared the results of putting two different data sets into two different analyses. Because 
there are many differences in the assumptions going into these analyses it is instructive to consider: (i) 
putting both data sets through the same analysis and (ii) putting one of the data sets through both analyses. 
For these comparisons, the ZEUS analysis does NOT include the jet data, so that the data sets are more 
directly comparable, involving just the inclusive double differential cross-section data. Fig. 32 compares 
the sea and gluon PDFs, at = lOGeV^, extracted from HI data using the HI PDF analysis with 
those extracted from HI data using the ZEUS PDF analysis. These alternative analyses of the same data 
set give results which are compatible within the model dependence error bands. Fig. 32 also compares 
the sea and gluon PDFs extracted from ZEUS data using the ZEUS analysis with those extracted from 
HI data using the ZEUS analysis. From this comparison we can see that the different data sets lead to 
somewhat different gluon shapes even when put through exactly the same analysis. Hence the most of 
the difference in shape of the ZEUS and HI PDF analyses can be traced back to a difference at the level 
of the data sets. 

3.3.3 Comparing the Offset and Hessian method of assessing correlated experimental uncertainties 

Before going further it is useful to discuss the treatment of coiTclated systematic errors in the ZEUS and 
HI analyses. A full discussion of the treatment of correlated systematic errors in PDF analyses is given in 
ref [31], only salient details are recapped here. Traditionally, experimental collaborations have evaluated 
an overall systematic uncertainty on each data point and these have been treated as uncorrelated, such that 
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Fig. 32: Sea and gluon distributions at = lOGeV^ extracted from different data sets and different analyses. Left plot: HI 
data put through both ZEUS and HI analyses. Middle plot: ZEUS data put through ZEUS analysis. Right plot: HI data put 
through ZEUS analysis. 



they are simply added to the statistical uncertainties in quadrature when evaluating x^. However, modem 
deep inelastic scattering experiments have very small statistical uncertainties, so that the contribution of 
systematic uncertainties becomes dominant and consideration of point to point correlations between 
systematic uncertainties is essential. 

For both ZEUS and HI analyses the formulation of the including correlated systematic uncer- 
tainties is constructed as follows. The correlated uncertainties are included in the theoretical prediction, 
Fi{p,s), such that 

A 

where, F^^^^^^{p), represents the prediction fromNLO QCD in terms of the theoretical parameters p, 
and the parameters sx represent independent variables for each source of systematic uncertainty. They 
have zero mean and unit variance by construction. The symbol A^^*^ represents the one standard deviation 
correlated systematic eiTor on data point i due to correlated eiTor source A. The is then formulated as 

2 _ [Fijp, s) - Fi{meas)f 2 ... 

X - 2^ + 2^ w 

where, Fj(meas), represents a measured data point and the symbol cr, represents the one standard devia- 
tion uncorrected eiTor on data point i, from both statistical and systematic sources. The experiments use 
this in different ways. ZEUS uses the Offset method and HI uses the Hessian method. 

Traditionally, experimentalists have used 'Offset' methods to account for correlated systematic 
errors. The is formluated without any terms due to correlated systematic errors (sx = in Eq. 6) 
for evaluation of the central values of the fit pai^ameters. However, the data points are then offset to 
account for each source of systematic eiTor in turn (i.e. set sa = +1 and then sx = —I for each source 
A) and a new fit is performed for each of these variations. The resulting deviations of the theoretical 
parameters from their central values are added in quadrature. (Positive and negative deviations are added 
in quadrature sepai^ately.) This method does not assume that the systematic uncertainties are Gaussian 
distributed. An equivalent (and much more efficient) procedure to perform the Offset method has been 
given by Pascaud and Zomer [50], and this is what is actually used. The Offset method is a conservative 
method of error estimation as compared to the Hessian method. It gives fitted theoretical predictions 
which are as close as possible to the central values of the pubUshed data. It does not use the full statistical 
power of the fit to improve the estimates of sx, since it choses to mistrust the systematic error estimates, 
but it is coiTcspondingly more robust. 

The Hessian method is an alternative procedure in which the systematic uncertainty parameters sx 
are allowed to vary in the main fit when determining the values of the theoretical parameters. Effectively, 
the theoretical prediction is not fitted to the central values of the published experimental data, but these 
data points are allowed to move collectively, according to their correlated systematic uncertainties. The 
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Fig. 33: PDFs at — lOGeV^, for the ZEUS analysis of ZEUS data performed by the Offset and the Hessian methods. 

theoretical prediction determines the optimal settings for correlated systematic shifts of experimental data 
points such that the most consistent fit to all data sets is obtained. Thus, in a global fit, systematic shifts 
in one experiment are correlated to those in another experiment by the fit. In essence one is allowing 
the theory to calibrate the detectors. This requires great confidence in the theory, but more significantly, 
it requires confidence in the many model choices which go into setting the boundary conditions for the 
theory (such as the parametrization at Qq). 

The ZEUS analysis can be performed using the Hessian method as well as the Offset method and 
Fig. 33 compares the PDFs, and their uncertainties, extracted from ZEUS data using these two methods. 
The central values of the different methods are in good agreement but the use of the Hessian method 
results in smaller uncertainties, for a the standard set of model assumptions, since the input data can 
be shifted within their correlated systematic uncertainties to suit the theory better. However, model un- 
certainties are more significant for the Hessian method than for the Offset method. The experimental 
uncertainty band for any one set of model choices is set by the usual tolerance, Ax^ = 1, but the 
acceptability of a different set of choices is judged by the hypothesis testing criterion, such that the 
should be approximately in the range N it y/{2N), where N is the number of degrees of freedom. The 
PDF parameters obtained for the different model choices can differ by much more than their experimen- 
tal uncertainties, because each model choice can result in somewhat different values of the systematic 
uncertainty parameters, s\, and thus a different estimate of the shifted positions of the data points. This 
results in a larger spread of model uncertainty than in the Offset method, for which the data points can- 
not move. Fig 31 illustrates the comparability of the ZEUS (Offset) total uncertainty estimate to the HI 
(Hessian) experimental plus model uncertainty estimate. 

Another issue which arises in relation to the Hessian method is that the data points should not be 
shifted far outside their one standard deviation systematic uncertainties. This can indicate inconsistencies 
between data sets, or parts of data sets, with respect to the rest of the data. The CTEQ collaboration have 
considered data inconsistencies in their most recent global fit [18]. They use the Hessian method but 
they increase the resulting uncertainty estimates, by increasing the tolerance to Ax^ = 100, to allow 
for both model uncertainties and data inconsistencies. In setting this tolerance they have considered 
the distances from the x^-minima of individual data sets to the global minimum for all data sets. These 
distances by far exceed the range allowed by the Ax^ = 1 criterion. Strictly speaking such variations can 
indicate that data sets ai^e inconsistent but the CTEQ collaboration take the view that all of the current 
world data sets must be considered acceptable and compatible at some level, even if strict statistical 
criteria are not met, since the conditions for the application of strict criteria, namely Gaussian error 
distributions, are also not met. It is not possible to simply drop "inconsistent" data sets, as then the 
partons in some regions would lose important constraints. On the other hand the level of "inconsistency" 
should be reflected in the uncertainties of the PDFs. This is achieved by raising the x^ tolerance. This 
results in uncertainty estimates which are comparable to those achieved by using the Offset method [48]. 
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Fig. 34: Top plots: Sea and gluon distributions at = lOGeV^ extracted from HI and ZEUS data using the ZEUS analysis 
(left) compared to those extracted from ZEUS data alone using the ZEUS analysis (right). Bottom Plots: Valence distributions 
at = lOGeV^, extracted from HI and ZEUS data using the ZEUS analysis (left) compared to those extracted from ZEUS 
data alone using the ZEUS analysis (right). 

3.3.4 Using both HI and ZEUS data in the same PDF analysis 

Using data from a single experiment avoids questions of data consistency, but to get the most information 
from HERA it is necessary to put ZEUS and HI data sets into the same analysis together, and then 
questions of consistency arise. Fig 34 compares the sea and gluon PDFs and the u and d valence PDFs 
extracted from the ZEUS PDF analysis of ZEUS data alone, to those extracted from the ZEUS PDF 
analysis of both HI and ZEUS data. It is noticeable that, for the low-x sea and gluon PDFs, combining 
the data sets does not bring a reduction in uncertainty equivalent to doubling the statistics. This is 
because the data which determine these PDFs are systematics limited. In fact there is some degree of 
tension between the ZEUS and the HI data sets, such that the per degree of freedom rises for both 
data sets when they are fitted together. The Offset method of treating the systematic errors reflects this 
tension such that the overall uncertainty is not much improved when HI data are added to ZEUS data. 
However, the uncertainty on the high-x valence distributions is reduced by the input of HI data, since 
the data are still statistics limited at high x. 

3.3.5 Combining the HI and ZEUS data sets before PDF analysis 

Thus there could be an advantage in combining ZEUS and HI data in a PDF fit if the tension between the 
data sets could be resolved. It is in this context the question of combining these data into a single data set 
arises. The procedure for combination is detailed in the contribution of S. Glazov to these proceedings 
(section 3.4). Essentially, since ZEUS and HI are measuring the same physics in the same kinematic 
region, one can try to combine them using a 'theory-free' Hessian fit in which the only assumption is 
that there is a true value of the cross-section, for each process, at each x, point. The systematic 
uncertainty parameters, s\, of each experiment are fitted to determine the best fit to this assumption. 
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Fig. 35: Left plot: Sea and gluon distributions at = lOGeV^, extracted from tlie combined HI and ZEUS data set using the 
ZEUS analysis. Right plot: Valence distributions at — lOGeV^, extracted from the combined HI and ZEUS data set using 
the ZEUS analysis. 




Fig. 36: Left plot: Sea and gluon distributions at = lOGeV^, extracted from the HI and ZEUS data sets using the ZEUS 
analysis done by Hessian method. Right plot: Valence distributions at = lOGeV^, extracted from the HI and ZEUS data 
sets using the ZEUS analysis done by Hessian method. 

Thus each experiment is caUbrated to the other. This works well because the sources of systematic 
uncertainty in each experiment are rather different. Once the procedure has been performed the resulting 
systematic uncertainties on each of the combined data points are significantly smaller than the statistical 
errors. Thus one can legitimately make a fit to the combined data set in which these statistical and 
systematic uncertainties are simply combined in quadrature. The result of making such a fit, using the 
ZEUS analysis, is shown in Fig. 35. The central values of the ZEUS and HI published analyses are also 
shown for comparison. Looking back to Fig. 34 one can see that there has been a dramatic reduction 
in the level of uncertainty compared to the ZEUS Offset method fit to the separate ZEUS and HI data 
sets. This result is very promising. A preliminary study of model dependence, varying the form of 
the polynomial, P{x), used in the PDF paremtrizations at Qq, also indicates that model dependence is 
relatively small. 

The tension between ZEUS and HI data could have been resolved by putting them both into a PDF 
fit using the Hessian method to shift the data points. That is, rather than calibrating the two experiments to 
each other in the 'theory-free' fit, we could have used the theory of pQCD to calibrate each experiment. 
Fig. 36 shows the PDFs extracted when the ZEUS and HI data sets are put through the ZEUS PDF 
analysis procedure using the Hessian method. The uncertainties on the resulting PDFs are comparable to 
those found for the fit to the combined data set, see Fig. 35. However, the central values of the resulting 
PDFs ai^e rather different- particularly for the less well known gluon and d valence PDFs. For both of 
the fits shown in Figs. 35, 36 the values of the systematic error parameters, sx, for each experiment have 
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Syatematic uncertainty sx in PDF fit in Theory-free fit 



ZEUS electron efficiency 


1.68 


0.31 


ZEUS electron angle 


-1.26 


-0.11 


ZEUS electron energy scale 


-1.04 


0.97 


ZEUS hadron calorimeter energy scale 


1.05 


-0.58 


HI electron energy scale 


-0.51 


0.61 


HI hadron energy scale 


-0.26 


-0.98 


HI calorimeter noise 


1.00 


-0.63 


HI photoproduction background 


-0.36 


0.97 



Table 10: Systematic shifts for ZEUS and HI data as determine by a jointpQCD PDF fit, and as determined by the theory-free 
data combination fit 

been allowed to float so that the data points are shifted to give a better fit to our assumptions, but the 
values of the systematic error parameters chosen by the 'theory-free' fit and by the PDF fit are rather 
different. A representaive sample of these values is given in Table 10. These discrepancies might be 
somewhat alleviated by a full consideration of model enws in the PDF fit, or of appropriate tolerance 
when combining the ZEUS and HI experiments in a PDF fit, but these differences should make us wary 
about the uncritical use of the Hessian method. 
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3.4 Averaging of DIS Cross Section Data 

The QCD fit procedures (Alekhin [40], CTEQ [18], MRST [19], HI [46], ZEUS [44]) use data from 
a number of individual experiments directly to extract the parton distribution functions (PDF). All pro- 
grams use both the central values of measured cross section data as well as information about the corre- 
lations among the experimental data points. 

The direct extraction procedure has several shortcomings. The number of input datasets is large 
containing several individual publications. The data points are correlated because of common systematic 
uncertainties, within and also across the publications. Handling of the experimental data without addi- 
tional expert knowledge becomes difficult. Additionally, as it is discussed in Sec. 3.3, the treatment of 
the correlations produced by the systematic errors is not unique. In the Lagrange Multiplier method [50] 
each systematic error is treated as a parameter and thus fitted to QCD. Error propogation is then used 
to estimate resulting uncertainties on PDFs. In the so-called "offset" method (see e.g. [44]) the datasets 
ai^e shifted in turn by each systematic eiTor before fitting. The resulting fits are used to form an envelope 
function to estimate the PDF uncertainty. Each method has its own advantages and shortcomings, and it 
is difficult to select the standard one. Finally, some global QCD analyses use non-statistical criteria to 
estimate the PDF uncertainties (Ax^ ^ 1). This is driven by the apparent discrepancy between different 
experiments which is often difficult to quantify. Without a model independent consistency check of the 
data it might be the only safe procedure. 

These drawbacks can be significantly reduced by averaging of the input structure function data 
in a model independent way before performing a QCD analysis of that data. One combined dataset 
of deep inelastic scattering (DIS) cross section measurements is much easier to handle compai^ed to a 
scattered set of individual experimental measurements, while retaining the full correlations between data 
points. The averaging method proposed here is unique and removes the drawback of the offset method, 
which fixes the size of the systematic uncertainties. In the averaging procedure the coiTclated systematic 
uncertainties are floated coherently allowing in some cases reduction of the uncertainty. In addition, study 
of a global /dof of the average and distribution of the pulls allows a model independent consistency 
check between the experiments. In case of discrepancy between the input datasets, localised enlargement 
of the uncertainties for the average can be performed. 

A standard way to represent a cross section measurement of a single experiment is given in the 
case of the F2 structure function by: 



X. 



rpi,true / pi i 



fpi.fruel r V- L 2 A^j daj^3j\ „ 



«1 (7) 



Here F| (af) are the measured central values (statistical and uncon^elated systematic uncertainties) of 
the F2 structure function^", aj are the coiTclated systematic uncertainty sources and /9aj are the 
sensitivities of the measurements to these systematic sources. Eq. 7 corresponds to the correlated prob- 
ability distribution functions for the structure function i?^'**""*^ and for the systematic uncertainties aj. 
Eq. 7 resembles Eq. 6 where the theoretical predictions for F2 are substituted by p^'^^'^^_ 

The function Eq. 7 by construction has a minimum = for i?^'*^"*^ = and Uj = 0. One 
can show that the total uncertainty for p^'*™*^ determined from the formal minimisation of Eq. 7 is equal 
to the sum in quadrature of the statistical and systematic uncertainties. The reduced covariance matrix 
00^(^2'*'^"'^, F^'*'"*^) quantifies the correlation between experimental points. 

In the analysis of data from more than one experiment, the Xtot function is taken as a sum of the 
functions Eq. 7 for each experiment. The QCD fit is then performed in terms of parton density functions 
which are used to calculate predictions for ^2'**^"*^. 

^"Contributing author: A. Glazov 

''"The structure function is measured for different (four momentum transfer squared) and Bjorken-x values which are 
omitted here for simplicity. 
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Fig. 37: dependence of the NC reduced cross section for x — 0.002 and x — 0.25 bins. HI data is shown as open circles, 
ZEUS data is shown as open squares and the average of HI and ZEUS data is shown as filled circles. The line represents the 
expectation from the HI PDF 2000 QCD fit. 



Before performing the QCD fit, tlie Xtot function can be minimised with respect to p^^"^"^^ and 
Uj. If none of correlated sources is present, this minimisation is equivalent to taking an average of the 
structure function measurements. If the systematic sources are included, the minimisation corresponds 
to a generalisation of the averaging procedure which contains correlations among the measurements. 

Being a sum of positive definite quadratic functions, Xtot ^1^° ^ positive definite quadratic and 
thus has a unique minimum which can be found as a solution of a system of linear equations. Although 
this system of the equations has a large dimension it has a simple structure allowing fast and precise 
solution. 

A dedicated program has been developed to perform this averaging of the DIS cross section data 
(http://www.desy.de/~glazov/f2av.tar.gz). This program can calculate the simultane- 
ous averages for neutral cuiTcnt (NC) and charged cuiTcnt (CC) electron- and positron-proton scattering 
cross section data including correlated systematic sources. The output of the program includes the cen- 
tral values and uncorrected uncertainties of the average cross section data. The correlated systematic 
uncertainties can be represented in terms of (i) covariance matrix, (ii) dependence of the average cross 
section on the original systematic sources together with the correlation matrix for the systematic sources, 
(iii) and finally the coiTclation matrix of the systematic sources can be diagonalised, in this case the form 
of for the average data is identical to Eq. 7 but the original systematic sources are not preserved. 

The first application of the averaging program has been a determination of the average of the 
published HI and ZEUS data [41,46,51-58]. Nine individual NC and CC cross section measurements 
are included from HI and seven are included from ZEUS. Several sources of systematic uncertainties are 
correlated between datasets, the correlations among HI and ZEUS datasets are taken from [46] and [45], 
respectively. No correlations are assumed between HI and ZEUS systematic uncertainties apart from a 
common 0.5% luminosity measurement uncertainty. The total number of data points is 1153 (552 unique 
points) and the number of coiTclated systematic sources, including normalisation uncertainties, is 43. 

The averaging can take place only if most of the data from the experiments are quoted at the same 
and x values. Therefore, before the averaging the data points are interpolated to a common , x 
grid. This interpolation is based on the HI PDF 2000 QCD fit [46]. The interpolation of data points in 
principle introduces a model dependency. For HI and ZEUS structure function data both experiments 
employ rather similar Q^, x grids. About 20% of the input points are interpolated, for most of the cases 



55 



the correction factors are small (few percent) and stable if different QCD fit parametrizations [18, 19] are 
used. 

The cross section data have also been corrected to a fixed center of mass energy squared S = 
101570 GeV^. This has introduced a small correction for the data taken at 5 = 90530 GeV^. The 
correction is based on HI -2000 PDFs, it is only significant for high inelasticity y > 0.6 and does not 
exceed 6%. 

The HERA data sets agree very well: x^/dof for the average is 521/601. The distribution of 
pulls does not show any significant tensions across the kinematic plane. Some systematic trends can 
be observed at low < 50 GeV^, where ZEUS NC data lie systematically higher than the HI data, 
although this difference is within the normalisation uncertainty. An example of the resulting average DIS 
cross section is shown in Fig. 37, where the data points are displaced in for clarity. 

A remarkable side feature of the averaging is a significant reduction of the correlated systematic 
uncertainties. For example the uncertainty on the scattered electron energy measurement in the HI back- 
ward calorimeter is reduced by a factor of three. The reduction of the correlated systematic uncertainties 
thus leads to a significant reduction of the total eiTors, especially for low < 100 GeV^, where sys- 
tematic uncertainties limit the measurement accuracy. For this domain the total errors are often reduced 
by a factor two compared to the total errors of the individual HI and ZEUS measurements. 

The reduction of the correlated systematic uncertainties is achieved since the dependence of the 
measured cross section on the systematic sources is significantly different between HI and ZEUS exper- 
iments. This difference is due mostly to the difference in the kinematic reconstruction methods used by 
the two collaborations, and to a lesser extent to the individual features of the HI and ZEUS detectors. 
For example, the cross section dependence on the scattered electron energy scale has a very particular 
behaviour for HI data which relies on kinematic reconstruction using only the scattered electron in one 
region of phase space. ZEUS uses the double angle reconstruction method where the pattern of this 
dependence is completely different leading to a measurement constraint. 

In summary, a generalised averaging procedure to include point-to-point correlations caused by 
the systematic uncertainties has been developed. This averaging procedure has been applied to HI and 
ZEUS DIS cross section data. The data show good consistency. The averaging of HI and ZEUS data 
leads to a significant reduction of the correlated systematic uncertainties and thus a large improvement in 
precision for low Q"^ measurements. The goal of the averaging procedure is to obtain HERA DIS cross 
section set which takes into account all correlations among the experiments. 
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3.5 The longitudinal structure function Fl 

3.5.1 Introduction 

At low X the sea quarks are determined by the accurate data on F2{x,Q'^) . The charm contribution to 
F2 is directly measured while there is no sepai^ation of up and down quarks at low x which ai^e assumed 
to have the same momentum distribution, see Sect. 3.6. Within this assumption, and setting the strange 
sea to be a fraction of the up/down sea, the proton quark content at low x is determined. The gluon 
distribution xg{x,Q^) , however, is determined only by the derivative dF2/d\nQ^ which is not well 
measured [41]. It is thus not surprising that rather different gluon distributions ai^e obtained in global 
NLO analyses, as is illustrated in Figure 38. The figure displays the result of recent fits by MRST and 
CTEQ on the gluon distribution at low and high Q^. It can be seen that there are striking differences at 
the initial scale, = 5 GeV^, which at high get much reduced due to the evolution mechanism. The 
ratio of these distributions, however, exhibits differences at lower x at the level of 10% even in the LHC 
Higgs and W production kinematic range, see Figure 39. One also observes a striking problem at large 
X which is beyond the scope of this note, however. In a recent QCD analysis it was observed [41] that 
the dependence of the gluon distribution at low x, xg oc x^"^, is correlated to the value of as(M|) , see 
Figure 40. 
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Fig. 38: Gluon momentum distributions determined by MRST and CTEQ in NLO QCD, as a function of x for = 5 GeV'^, 
close to the initial scale of the fits, and at higher as the result of the DGLAP evolution. 



In the Quark-Parton Model the longitudinal structure function Fl{x, Q^) is zero [59]. In DGLAP 
QCD, to lowest order, Fl is given by [60] 



FU.,Q^) = ^x^l ^ 



(8) 



with contributions from quarks and from gluons. Approximately this equation can be solved [61] and the 
gluon distribution appears as a measurable quantity. 



xg{x) = 1.8[^Fl(0.4x) - F2(0.8x] ~ —Fl, 



(9) 



determined by measurements of F2 and Fl . Since Fl , at low x, is not much smaller than F2 , to a good 
approximation Fl is a direct measure for the gluon distribution at low x. 

Apart from providing a very useful constraint to the determination of the gluon distribution, see 
also Sect. 3.7, a measurement of Fl{x, Q'^) is of principal theoretical interest. It provides a crucial test 
of QCD to high orders. A significant departure of an Fl measurement from the prediction which is 



""Contributing authors; J. Feltesse, M. Klein 
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based on the measurement of F2{x,Q'^) and dF2/dlnQ'^ only, would require theory to be modified. 
There are known reasons as to why the theoretical description of gluon radiation at low x may differ 
from conventional DGLAP evolution: the neglect of ln(l/x), in contrast to BFKL evolution, or the 
importance of NLL resummation effects on the gluon splitting function (see Sect. 5.3). Furthermore 
recent calculations of deep inelastic scattering to NNLO predict very large effects from the highest order 
on Fl contrary to F2 [62]. 

Within the framework of the colour dipole model there exists a testable prediction for Fl{x,Q^) , 
and the longitudinal structure function, unlike F2 , may be subject to large higher twist effects [63]. 
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Fig. 40: Correlation of the low x behaviour of the gluon distribution, characterised by the power x"'" , with the strong coupling 
constant as obtained in the HI NLO QCD fit to HI and BCDMS data. 




3.5.2 Indirect Determinations of Fl at Low x 

So far first estimates on Fi{x, Q^) at low x have been obtained by the HI Collaboration. These result 
from data on the inclusive ep — > eX scattering cross section 

'^^'^ [F2{x,Q^)-f{y)-FL{x,Q')]=ar (10) 



27ra2y+ dxdQ^ 
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obtained at fixed, lai^ge energy, s = AEeEp. The cross section is defined by the two proton structure 
functions, F2 and Fl , with y+ = 1 + (1 — y)^ and f{y) = y"^ /Yj^. At fixed s the inelasticity y is 
fixed by x and a.s y = Q'^/sx. Thus one can only measure a combination F2 — f{y)FL. Since 
HERA accesses a large range of y, and f{y) is large only at large y > 0.4, assumptions have been 
made on Fl to extract F2 at larger y. Since the cross section measurement accuracy has reached the 
few per cent level [41], the effect of the Fl assumption on F2 at lowest x has been non-negligible. The 
determination of F2{x, Q^) has thus been restricted to a region in which y < 0.6. The proton structure 
function F2{x, Q"^) is known over a few orders of magnitude in x rather well, from HERA and at largest 
x from fixed target data. Thus HI did interpret the cross section at higher y as a determination of 
Fl{x,Q'^) imposing assumptions about the behaviour of F2{x,Q'^) at lowest x. These were derived 
from QCD fits to the HI data [64] or at lower Q^, where QCD could not be trusted, from the derivative 



a new 



of F2 [65]. Recently, with the established x behaviour [66] of F2{x,Q ) = c{Q )x 
method [65] has been used to determine F^ . This "shape method" is based on the observation that the 
shape of Gr, Eq. 10, at high y is driven by / oc and sensitivity to F^ is restricted to a very nan^ow 
range of x corresponding to y = 0.3 — 0.9. Assuming that Fl{x, Q"^) in this range, for each bin in 



Q , does not depend on x, one obtains a simple relation, Ur 



cx 



fFi. which has been used to 



determine Fi{x, Q ) . Figure 41 shows the existing, preliminary data on Fi{x, Q ) at low Q from the 



_ Q'=0.75 GeV 


■ Q'=^.35CeV 

-W; 


■ Q'=2.2 GeV 






' 


■ ' 


' 


Q'=4.2 GeV 


■ \_Q'=7,5 GeV" 


10"^ 10"'' 

• HI svtxOO 
■ H1 mb99 
i H1 96/97 

NLO a, fit (H1) 

NLO f!t (ZEUS) 

NLO MRST2001 

NLO (Alekhin) 







Fig. 41: Data on the longitudinal structure function obtained using assumptions on the behaviour of the other structure function 
F2 in comparison with NLO QCD fit predictions. The data labeled svtxOO and mb99 data are preliminary. 

HI Collaboration in comparison with predictions from NLO DGLAP QCD fits to HERA and further 
cross section data. One can see that the accuracy and the x range of these Fl{x, Q^) determinations are 
rather limited although the data have some discriminative power already. 



3.5.3 Backgrounds and Accuracy 

The longitudinal structure function contribution to (7^ represents a small correction of the cross section in 
a small part of the kinematic range only. The demands for the Fl measurement are extremely high: the 
cross section needs to be measured at the per cent level and the scattered electron be uniquely identified 
up to high y. The method of unfolding F2 and Fl consists in a measurement of fx,. at fixed x and with 
varying s. This allows both structure functions to be determined from a straight line variation of ar as a 
function of f{y), see [67]. 

At lai^ge y, comesponding to low x, and low the scattering kinematics at HERA resembles 
that of a fixed target scattering experiment: the electron scattered off quarks at very low x ("at rest") is 
going in the backward detector region, i.e. in the direction of the electron beam. The scattered electron 
is accompanied by part of the hadronic final state which is related to the struck quark. High inelasticities 
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y ~ 1 — E'^/Ee demand to identify scattered electrons down to a few GeV of energy E'^. Thus a 
considerable background is to be isolated and removed which stems from hadrons or photons, from the 
ttq — > 77 decay. These particles may originate both from a genuine DIS event but to a larger extent 
stem from photoproduction processes, in which the scattered electron escapes mostly non recognised in 
electron beam direction. Removal of this background in HI is possible by requiring a track associated 
to the Spacal cluster, which rejects photons, and by measuring its charge which on a statistical basis 
removes the remaining part of the background as was demonstrated before [41, 65]. 

The scattered electron kinematics, E'^ and 9^, can be accurately reconstructed using the high res- 
olution Spacal calorimeter energy determination and the track measurements in the Backward Silicon 
Tracker (BST) and the Central Jet Drift Chamber (CJC). Reconstruction of the hadronic final state al- 
lows the energy momentum constraint to be imposed, using the "E — pz" cut, which removes radiative 
corrections, and the Spacal energy scale to be calibrated at large E'^ using the double angle method. At 
low energies E'^ the Spacal energy scale can be calibrated to a few % using the ttq mass constraint and be 
cross checked with the BST momentum measurement and with QED Compton events. The luminosity 
is measured to 1-2%. Any common normalisation uncertainty may be removed, or further constrained, 
by comparing cross section data at very low y where the contribution of is negligible. 

Subsequently two case studies are presented which illustrate the potential of measuring Fl directly 
in unfolding it from the large F2 contribution to the cross section, a study using a set of 3 low proton beam 
energies and a simulation for just one low Ep data set combined with standard 920 GeV data. Both 
studies use essentially the same correlated systematic errors and differ slightly in the assumptions on the 
background and efficiency uncertainties which regard the eiTors on cross section ratios. The following 
assumptions on the con^elated systematics are used: 5E'^/E'^ = 0.003 at large E^. linearly rising to 0.03 
at 3 GeV; 69e = 0.2mrad in the BST acceptance region and 1 mrad at larger angles; 5Eh/Eh = 0.02. 
These and further assumed systematic uncertainties represent about the state of analysis reached so far 
in inclusive low cross section measurements of HI. 

3.5.4 Simulation Results 

A simulation has been performed for Ef, = 27.6 GeV and for four different proton beam energies, 
Ep = 920, 575, 465 and 400 GeV assuming luminosities of 10, 5, 3 and 2pb~^ , respectively. The beam 
energies are chosen such that the cross section data ai^e equidistant in f{y). If the luminosity scales as 
expected as E^, the low Ep luminosities are equivalent to 35 pb^^ at standard HERA settings. Further 
systematic eiTors regard the residual radiative conections, assumed to be 0.5%, and the photoproduction 
background, 1-2% depending on y. This assumption on the background demands an improvement by a 
factor of about two at high y which can be expected from a high statistics subtraction of background using 
the charge assignment of the electron scattering candidate. An extra uncon^elated efficiency coiTcction is 
assumed of 0.5%. The resulting cross section measurements are accurate to 1-2%. For each and x 
point this choice provides up to four cross section measurements. The two structure functions are then 
obtained from a fit to ar = F2 + f{y)FL taking into account the correlated systematics. This separation 
provides also accurate data of F2, independently of Fl . The simulated data on Fl span nearly one order 
of magnitude in x and ai^e shown in Figure 42. For the chosen luminosity the statistical and systematic 
errors on Fl are of similar size. The overall accuracy on Fl{x,Q'^) , which may be obtained according 
to the assumed experimental uncertainties, is thus estimated to be of the order of 10-20%. 

Based on recent information about aspects of the machine conditions in a low proton beam energy 
mode, a further case study was performed [68] for only one reduced proton beam energy. In this simula- 
tion, for the standard electron beam energy of Ef. = 27.6 GeV, proton beam energies of Ep = 920 and 
460 GeV were chosen with luminosities of 30 and \Qph~^, respectively. According to [69] it would take 
about three weeks to change the configuration of the machine and to tune the luminosity plus 10 weeks to 
record 10 pb^^ of good data with High Voltage of trackers on. Uncertainties besides the con^elated errors 
specified above are assumed for photo-production background subtraction varying from 0% at y=0.65 to 
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Fig. 42: Simulated measurement of the longitudinal structure function Fl{x, Q^) using the HI backward apparatus to recon- 
struct the scattered electron up to maximum inelasticities of y = 0.9 corresponding to a mimimum electron energy of E'^ of 
about 3 GeV. The inner error bar is the statistical error. The full error bar denotes the statistical and systematic uncertainty 
added in quadrature. 



4% at y = 0.9, and of 0.5% for the residual radiative corrections. An overall uncertainty of 1% is assumed 
on the measurement of the cross section at low beam energy settings, which covers relative uncertainties 
on electron identification, trigger efficiency, vertex efficiency, and relative luminosity. 

To evaluate the errors two independent methods have been considered an analytic calculation and a 
fast Monte-Carlo simulation technique. The two methods provide statistical and systematic eiTors which 
are in excellent agreement. The overall result of this simulation of Fl is displayed in Figure 43. In 
many bins the overall precision on Fl{x, Q"^) is around or below 20%. It is remarkable that the overall 
precision would stay below 25% even if the statistical error or the lai^ger source of systematic uncertainty 
would turn out to be twice larger than assumed to be in this study. 





Fig. 43: Simulated measurement of the longitudinal structure function Fl{x, Q^) for data at 920 GeV (30 ph ^) and 460 GeV 
(10p6~^). The inner error bar is the statistical error. The full error bar denotes the statistical and systematic uncertainty added 
in quadrature. 
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3.5.5 Summary 

It has been demonstrated with two detailed studies that a direct measurement of the longitudinal structure 
function Fl{x, Q"^) may be performed at HERA at the five sigma level of accuracy, in the x range from 
10~^ to 10"'^ in four bins of Q^. This measurement requires about thi^ee months of running and tuning 
time at reduced proton beam energy. In addition it would provide the first measurement of the diffractive 
longitudinal structure function at the three sigma level (see the contribution of P. Newman in the summary 
of Working Group 4). The exact choice of the parameters of such a measurement are subject to further 
studies. In conclusion an accurate measurement of Fl{x, Q'^) is feasible, it requires efficient detectors, 
dedicated beam time and analysis skills. It would be the right sign of completion to have measured 
F2 first, in 1992 and onwards, and to conclude the HERA data taking with a dedicated measurement of 
the second important structure function Q^) , which is related to the gluon density in the low x 

range of the LHC. 
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Fig. 44: Determination of tiie sum of up, anti-up, down and anti-downquark distributions and of thie gluon distribution in the 
proton based on the HI neutral and charged current cross section data. Left: for of 10 and 1000 GeV^ compared with results 
from MRST and CTEQ; Right: the parton distributions with their experimental and model uncertainties as determined by HI 
at the starting scale Q% — 4 GeV^ . 

3.6 Determination of the Light Quark Momentum Distributions at Low x at HERA 

Based on the data taken in the first phase of HERA's operation (1993-2000), the HERA collider exper- 
iments have measured a complete set of neutral (NC) and charged (CC) current double differential e^p 
inclusive scattering cross sections, based on about 120 pb~^ of positron-proton and 15pb~^ of electron- 
proton data. The NC and CC deep inelastic scattering (DIS) cross sections for unpolarised e^p scattering 
are determined by structure functions and quark momentum distributions in the proton as follows: 

4^~y+F2Tl^-xF3, (11) 

F2c^elx{U + U) + elx{D + D), (12) 

xF3 ~ 2x[aueu{U -U) + aded{D - D)], (13) 

a+c--xU + {l-yfxD, (14) 

a^c ~ xC/ + (1 - yfxD. (15) 

Here y = Q'^/sx is the inelasticity, s = AE^-Ep and 1^ = 1 it (1 — y)^. The parton distribution 
U = u + c + bis the sum of the momentum distributions of the up-type quarks with charge e„ = 2/3 
and axial vector coupling = 1/2, while D = d + s is the sum of the momentum distributions of the 
down type quarks with charge = —l/3,ad = —1/2. Similar relationships hold for the anti-quark 
distributions U and D. 

As is illustrated in Fig. 44 the HI experiment [46] has determined all four quark distributions 
and the gluon distribution xg. The accuracy achieved so far by HI, for x = 0.01,0.4 and 0.65, is 
1%, 3%, 7% for the sum of up quark distributions and 2%, 10%, 30% for the sum of down quark dis- 
tributions, respectively. The extracted parton distributions are in reasonable agreement with the results 
obtained in global fits by the MRST [19] and CTEQ [18] collaborations. The HI result is also consistent 
with the pdfs determined by the ZEUS Collaboration [45] which uses jet data to improve the accuracy for 
the gluon distribution and imports a. d — u asymmetry fit result from MRST. New data which are being 
taken (HERA II) will improve the accuracy of these determinations further. At the time this is written, 
the available data per experiment have been grown to roughly 150 pb~^ for both e'^p and e^p scattering, 

^"Contributing authors: M. Klein, B. Reisert 
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Fig. 45: Parton distributions and their uncertainties as determined by HI extrapolated to the region of the LHC, for x = 0.001 
near to the rapidity plateau. Top left: u valence; top right: d valence; bottom left: u and below c; bottom right, in decreasing 
order: d, s, b. The results are compared with recent fits to global data by MRST and CTEQ. Note that at such small x the 
valence quark distributions are very small. With increasing the relative importance of the heavy quarks compared to the 
light quarks increases while the absolute difference of the quark distributions is observed to be rather independent of Q^. The 
beauty contribution to the cross section thus becomes sizeable, amounting to about 5% for pp HW. 

and more is still to come. These data will be particularly important to improve the accuracy at large x, 
which at HERA is related to high Q^. 

As is clear from the above equations, the NC and CC cross section data are sensitive directly to 
only these four quark distribution combinations. Disentangling the individual quark flavours (up, down, 
strange, charm and beauty) requires additional assumptions. While informations on the c and b densities 
are being obtained from measurements of F^*^ and FI** of improving accuracy, the determination of the 
strange quark density at HERA is less straightforward and may rest on sW~^ — > c and strange ($) particle 
production [70]. The relative contributions from the heavy quarks become increasingly important with 
Q^, as is illustrated in Fig. 45. 

The larger x domain is dominated by the valence quarks. At HERA the valence quark distributions 
are not directly determined but extracted from the differences u^, = U — U and = D — D. Note that 
this implies the assumption that sea and anti-quarks are equal which in non-perturbative QCD models 
may not hold. A perhaps more striking assumption is inherent in these fits and regards the sea quark 
asymmetries at low x which is the main subject of the subsequent discussion. 

Fig. 46 shows the difference xd — xu as determined in the HI PDF 2000 fit based on the HI data 
alone (left) and using in addition the BCDMS proton and deuteron data (right). One observes a trend of 
these fits to reproduce the asymmetry near x ~ 0.1 which in the MRST and CTEQ fits, shown in Fig. 46, 
is due to fitting the Drell Yan data from the E866/NuSea experiment [71]. While this enhancement is not 
very stable in the HI fit [72] and not significant either, with the BCDMS data an asymmetry is observed 
which reflects the violation of the Gottfried sum rule. 

In the HI fit [46] the parton distributions at the initial scale = 4GeV^ are parameterised as 
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Fig. 46: Determination of the difference x(d-u) in the HI PDF 2000 fit to the HI data (left) and the HI and the BCDMS fip 
and fiD data (right). The sea quark difference is enforced to tend to zero at low x. The global fit results of MRST and CTEQ 
include Drell Yan data which suggest a sea quark asymmetry at a:: ~ 0.1. 

xP = ApX^'' (1 — x)*^^ • fp{x). The function fp is a polynomial in x which is determined by requiring 
"X^ saturation" of the fits, i.e. starting from fp = l additional terms Dpx, Epx"^ etc. are added and only 
considered if they cause a significant improvement in x^. half integer powers were considered in [72]. 
The result for fitting the HI data has been as follows: fg = {l + Dgx), fu = {1 + Dux + Fux^), 
= (1 + Dux) and fjj = fjj = 1. The parton distributions at low x are thus pai^ameterised as 
xP — > Apx^P . The strange (charm) anti-quark distribution is coupled to the total amount of down (up) 
anti-quarks as s = fcD (c = fcU). Two assumptions have been made on the behaviour of the quark 
and anti-quark distributions at low x. It has been assumed that quark and anti-quark distributions are 
equal and, moreover, that the sea is flavour symmetric. This implies that the slopes B of all four quark 
distributions are set equal Bu = Bjj = Bjj = Bjy. Moreover, the nomalisations of up and down quarks 
are the same, i.e. Ajj{l — fc) = A-j^{\ — fs), which ensures that d/u — > 1 as x tends to zero. The 
consequence of this assumption is illustrated in Fig. 46. While the DIS data suggest some asymmetry at 
larger x, the up-down quark asymmetry is enforced to vanish at lower x. This results in a rather fake 
high accuracy in the determination of the four quark distributions at low x, despite the fact that at low 
X there is only one combination of them measured, which is F2 = x[4{U + U) + {D + D)]/9. If one 
relaxes both the conditions on the slopes and normalisations, the fit to the HI data decides to completely 
remove the down quark contributions as is seen in Fig. 47 (left plot). 

In DIS the up and down quark asymmetry can be constrained using deuteron data because the 
nucleon structure function determines a different linear combination according to = 5x{U + U + 
D + D)/18 + x{c + c — s — s)/6 with = {p + n)/2. Unfortunately, there are only data at rather large x 
available. The effect of including the BCDMS data on the low x behaviour of the parton distributions is 
illustrated in Fig. 47 (right plot). It restores some amount of down quarks at low x , the errors, however, in 
particular of the down quarks, are still very large. The result is a large sea quark asymmetry uncertainty, 
which is shown in Fig. 48. At HERA a proposal had been made [73] to operate the machine in electron- 
deuteron mode. Measuring the behaviour at low x would not require high luminosity. Such data would 
constrain a possible sea quark asymmetry with very high accuracy, as is also shown in Fig. 48. 

Deuterons at HERA would require a new source and modest modifications to the preaccelerators. 
The HI apparatus could be used in its standard mode with a forward proton detector added to take 

Constraints on the sea quark distributions may also be obtained from /W~ production at the TeVatron. However, the 
sensitivity is limited to larger x > 0.1 [74] since W's produced in collisions involving sea quarks of smaller x will be boosted 
so strongly, that their decay products are not within the acceptance of the collider detectors. and production at the 
LHC has been discussed in [75]. 
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Fig. 47: Determinations of thie quark and gluon momentum distributions releasing tiie constraint xd = xu at low x, from the 
HI NC and CC data alone (left) and from the Hi ep and the BCDMS ^ip and /iD data (right). Since at low x < 0.01 there is 
no further constraint than that given from F2 the uncertainties of U and in particular of D become sizeable. 

data at half the beam energy. Tagging the spectator protons with high accuracy at HERA, for the first 
time in DIS, one could reconstruct the electron-neutron scattering kinematics essentially free of nuclear 
corrections [73]. Since the forward scattering amplitude is related to diffraction one would also be 
able to constrain shadowing to the per cent level [76]. The low x measurements would require small 
luminosity amounts, of less than 50pb~^. Long awaited constraints of the d/u ratio at large x and 
would require extended running, as would CC data. Besides determining the parton distributions 
better, the measurement of the singlet structure function would give important constraints on the 
evolution and theory at low x [77]. It would also result in an estimated factor of two improvement on the 
measurement of at HERA [78]. For the development of QCD, of low x physics in particular, but as 
well for understanding physics at the LHC and also for superhigh energy neutrino astrophysics, HERA 
eD data remain to be important. 
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Fig. 48: Simulation of the difference of sea quark distributions, here assumed to be zero, at low x based on additional 20pb~ 
of electron-deuteron data at HERA. The error band represents the uncertainty of the HI NLO QCD fit to the HI ep and the 
BCDMS fip and fid data without the constraint d — u at low x. The dashed curves represent calculations using recent global 
fits by MRST and by CTEQ. 
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Table 11: The data-sets included in the ZEUS- JETS and HERA-II projected PDF fits. The first column lists the type of data 
and the second gives the kinematic coverage. The third column gives the integrated luminosities of the HERA-I measurements 
included in the ZEUS-JETS fit. The fourth column gives the luminosities assumed in the HERA-II projection. Note that the 
96-97 NC and the 94-97 CC measurements have not had their luminosity scaled for the HERA-II projection. 



3.7 Impact of future HERA data on the determination of proton PDFs using the ZEUS NLO 
QCD fit 

3. 7. 1 PDF fits to HERA data 

Recently, the ZEUS Collaboration have performed a combined NLO QCD fit to inclusive neutral and 
charged current DIS data [53-58] as well as high precision jet data in DIS [79] and 7p scattering [80]. 
This is called the ZEUS -JETS PDF fit [45]. The use of only HERA data eliminates the uncertainties from 
heavy-target corrections and removes the need for isospin symmetry assumptions. It also avoids the dif- 
ficulties that can sometimes arise from combining data-sets from several different experiments, thereby 
allowing a rigorous statistical treatment of the PDF uncertainties. Furthermore, PDF uncertainties from 
current global fits ai^e, in general, limited by (iiTcducible) experimental systematics. In contrast, those 
from fits to HERA data alone, are largely limited by the statistical precision of existing measurements. 
Therefore, the impact of future data from HERA is likely to be most significant in fits to only HERA 
data. 

3. 7. 2 The ZEUS NLO QCD fit 

The ZEUS-JETS PDF fit has been used as the basis for all results shown in this contribution. The most 
important details of the fit are summarised here. A full description may be found elsewhere [45]. The 
fit includes the full set of ZEUS inclusive neutral and charged current e^p data from HERA-I (1994- 
2000), as well as two sets of high precision jet data in e+p DIS (Q^ >> 1 GeV^) and 7p (Q^ ~ 0) 
scattering. The inclusive data used in the fit, span the kinematic range 6.3 x 10^^ < x < 0.65 and 
2.7 < Q2 < 30000 GeV^. 

The PDFs are obtained by solving the NLO DGLAP equations within the MS scheme. These 
equations yield the PDFs at all values of provided they are input as functions of x at some starting 
scale Qq- The resulting PDFs are convoluted with coefficient functions to give predictions for structure 
functions and, hence, cross sections. In the ZEUS fit, the xuv{x) (li-valence), xdv{x) (d- valence), xS{x) 
(total sea-quark), xg{x) (gluon) and x{d{x) — u{x)) PDFs are parameterised at a starting scale of Qq = 7 
GeV^ by the form, 

xf(x) = pixP2(l - x)P^P{x), (16) 
''^Contributing authors: C. Gwenlan, A. Cooper- Sarkar, C. Targe tt- Adams. 
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Fig. 49: The optimised jet cross sections included in tlie HERA-II projected fit. The solid points show the simulated data 
generated using the NLO QCD programme of Frixione-Ridolfi, using the CTEQ5M1 proton and the AFG photon PDFs. The 
error bars show the statistical uncertainties, which correspond to 500 pb~^ of HERA data. Systematic uncertainties have been 
neglected. The dashed line shows the NLO QCD prediction using the ZEUS-S proton and AFG photon PDFs. The shaded band 
shows the contribution to the cross section uncertainty arising from the uncertainty in the gluon distribution in the proton. 
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Fig. 50: The fractional PDF uncertainties, as a function of x, for the u-valence, d-valence, sea-quark and gluon distributions at 
= 1000 GeV^. The red shaded bands show the results of the ZEUS-JETS fit and the yellow shaded bands show the results 
of the HERA-II projected fit. 



where P{x) = (1 + p^x). No advantage in the results from using more complex polynomial forms. 
The normalisation parameters, pi(My) and pi{d.u), are constrained by quark number sum rules while 
pi (g) is constrained by the momentum sum rule. Since there is no information to constrain any difference 
in the low-x behaviour of the u- and c?- valence quarks, P2{uy) has been set equal to P2{dv). The data 
from HERA are currently less precise than the fixed target data in the high-.x regime. Therefore, the high- 
X sea and gluon distributions are not well constrained in current fits to HERA data alone. To account 
for this, the sea shape has been restricted by setting P4{S) = 0. The high-x gluon shape is constrained 
by the inclusion of HERA jet data. In fits to only HERA data, there is no information on the shape of 
d — u. Therefore, this distribution has its shape fixed consistent with Drell-Yan data and its normalisation 
set consistent with the size of the Gottfried sum rule violation. A suppression of the strange sea with 
respect to the non-strange sea of a factor of 2 at Qq is also imposed, consistent with neutrino induced 
dimuon data from CCFR. The value of the strong coupling has been fixed to as{Mz) = 0.1180. After all 
constraints, the ZEUS-JETS fit has 1 1 free parameters. Heavy quarks were treated in the variable flavour 
number scheme of Thorne & Roberts [49]. Full account was taken of coiTclated experimental systematic 
uncertainties, using the Offset Method [44,48]. 

The results of two separate studies are presented. The first study provides an estimate of how 
well the PDF uncertainties may be known by the end of HERA-II, within the currently planned running 
scenario, while the second study investigates the impact of a future HERA measurement of Fl on the 
gluon distribution. All results presented, are based on the recent ZEUS -JETS PDF analysis [45]. 



3.7.3 PDF uncertainty estimates for the end of HERA running 

The data from HERA-I are already very precise and cover a wide kinematic region. However, HERA-II is 
now running efficiently and is expected to provide a substantial increase in luminosity. Current estimates 
suggest that, by the end of HERA running (in mid-2007), an integrated luminosity of 700 pb~^ should 
be achievable. This will allow more precise measurements of cross sections that are curently statistically 
hmited: in particular, the high-Q^ NC and CC data, as well as high-Q^ and/or high-E'x jet data. In 
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Fig. 51: NLO QCD inclusive jet cross section predictions at ^/s=l4 TeV in three regions of pseudo-rapidity. The yellow and 
blue bands show the PDF uncertainties from the ZEUS-JETS and HERA-II projected fits, respectively. 



addition to the simple increase in luminosity, recent studies [81] have shown that future jet cross section 
measurements, in kinematic regions optimised for sensitivity to PDFs, should have a significant impact 
on the gluon uncertainties. In this contribution, the effect on the PDF uncertainties, of both the higher 
precision expected from HERA-II and the possibility of optimised jet cross section measurements, has 
been estimated in a new QCD fit. This fit will be referred to as the "HERA-II projection". 

In the HERA-II projected fit, the statistical uncertainties on the currently available HERA-I data 
have been reduced. For the iiigh-Q^ inclusive data, a total integrated luminosity of 700 pb^^ was as- 
sumed, equally divided between e+ and e^. For the jet data, an integrated luminosity of 500 pb~^ was 
assumed. The central values and systematic uncertainties were taken from the published data in each 
case. In addition to the assumed increase in precision of the measurements, a set of optimised jet cross 
sections were also included, for forward dijets in 7p collisions, as defined in a recent study [81]. Since 
no real data are yet available, simulated points were generated using the NLO QCD program of Frixione- 
Ridom [82], using the CTEQ5M1 [18] proton and AFG [83] photon PDFs. The statistical uncertainties 
were taken to correspond to 500 pb~^. For this study, systematic uncertainties on the optimised jet cross 
sections were ignored. The simulated optimised jet cross section points, compared to the predictions of 
NLO QCD using the ZEUS-S proton PDF [44], are shown in Fig. 49. 

Table 1 1 lists the data-sets included in the ZEUS-JETS and HERA-II projected fits. The lumi- 
nosities of the (real) HERA-I measurements and those assumed for the HERA-II projection are also 
given. 

The results are summarised in Fig. 50, which shows the fractional PDF uncertainties, for the u- 
and d-valence, sea-quark and gluon distributions, at = 1000 GeV^. The yellow bands show the 
results of the ZEUS-JETS fit while the red bands show those for the HERA-II projection. Note that the 
same general features are observed for all values of Q^. In fits to only HERA data, the information on the 
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valence quarks comes from the high-Q^ NC and CC cross sections. The increased statistical precision 
of the high-Q^ data, as assumed in the HERA-II projected fit, gives a significant improvement in the 
valence uncertainties over the whole range of x. For the sea quarks, a significant improvement in the 
uncertainties at high-x is also observed. In contrast, the low-x uncertainties are not visibly reduced. This 
is due to the fact that the data constraining the low-x region tends to be at lower-Q^, which are already 
systematically limited. This is also the reason why the low-x gluon uncertainties are not significantly 
reduced. However, the mid-to-high-x gluon, which is constrained by the jet data, is much improved in 
the HERA-II projected fit. Note that about half of the observed reduction in the gluon uncertainties is 
due to the inclusion of the simulated optimised jet cross sections. 

Inclusive jet cross sections at the LHC The improvement to the high-x partons, observed in the 
HERA-II projection compared to the ZEUS-JETS fit, will be particularly relevant for high-scale physics 
at the LHC. This is illustrated in Fig. 51, which shows NLO QCD predictions from the JETRAD [84] 
programme for inclusive jet production at a/s = 14 TeV. The results are shown for both the ZEUS-JETS 
and the HERA-II projected PDFs. The uncertainties on the cross sections, resulting from the PDFs, have 
been calculated using the LHAPDF interface [85]. For the ZEUS-JETS PDF, the uncertainty reaches 
^ 50% at central pseudo-rapidities, for the highest jet transverse energies shown. The prediction using 
the HERA-II projected PDF shows a maifed improvement at high jet ti'anverse energy. 

3. 7.4 Impact of a future HERA measurement of Fi on the gluon PDF 

The longitudinal structure function, F^, is directly related to the gluon density in the proton. In principle, 
Fl can be extracted by measuring the NC DIS cross section at fixed x and Q^, for different values of y 
(see Eqn. 4). A precision measurement could be achieved by varying the centre-of-mass energy, since 
s = Q"^ /xy K, AEeEp, where E^, and Ep are the electron and proton beam energies, respectively. 
Studies [67] (Sec. 3.5) have shown that this would be most efficiently achieved by changing the proton 
beam energy. However, such a measurement has not yet been performed at HERA. 

There are several reasons why a measurement of Fl at low-x could be important. The gluon 
density is not well known at low-x and so different PDF parameterisations can give quite different pre- 
dictions for Fl at low-x. Therefore, a precise measurement of the longitudinal sturcture function could 
both pin down the gluon PDF and reduce its uncertainties. Furthermore, predictions of Fl also depend 
upon the nature of the underlying theory (e.g. order in QCD, resummed calculation etc). Therefore, a 
measurement of Fl could also help to discriminate between different theoretical models. 

Impact on the gluon PDF uncertainties The impact of a possible future HERA measurement of Fl 
on the gluon PDF uncertainties has been investigated, using a set of simulated Fl data-points [67]. (see 
Sec. 3.5). The simulation was performed using the GRV94 [86] proton PDF for the central values, and 
assuming Ee = 27.6 GeV and Ep = 920, 575, 465 and 400 GeV, with luminosities of 10, 5, 3 and 2 
pb^^, respectively. Assuming that the luminosity scales simply as Ep, this scenario would nominally 
cost 35 pb~^ of luminosity under standard HERA conditions. However, this estimate takes no account 
of time taken for optimisation of the machine with each change in Ep, which could be considerable. The 
systematic uncertainties on the simulated data-points were calculated assuming a ~ 2% precision on the 
inclusive NC cross section measurement. A more comprehensive description of the simulated data is 
given in contribution for this proceedings, see Sec. 3.5. 

The simulated data were included in the ZEUS-JETS fit. Figure 52 shows the gluon distribution 
and fractional uncertainties for fits with and without inclusion of the simulated Fl data. The results 
indicate that the gluon uncertainties are reduced at low-x, but the improvement is only significant at 
relatively low < 20 GeV^. 
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Fig. 52: The gluon PDFs, showing also the fractional uncertainty, for fits with and without inclusion of the simulated Fl data, 
for = 1.5, 5, 10 and 20 GeV^. The red shaded bands show the results of the ZEUS-JETS fit and the yellow shaded band 
show the results of the ZEUS-JETS+Fz, fit. 





PDF 


QCD order of coefficient functions 


Maximum Fl 


MRSG95 


NNLO 


Middle Fl 


GRV94 


NLO 


Minimum Fl 


MRST2003 


NLO 



Table 12: Summary of the PDFs used to generate the simulated Fl data-points. The extreme maximum Fl points were 
generated using the MRSG95 PDF, and convoluted with NNLO coefficient functions. The middle points were generated using 
the GRV94 PDF, and the extreme minimum points were generated using the MRST2003 PDF, which has a negative gluon at 
low-s. 

Discrimination between theoretical models In order to assess whether a HERA measurement of 
Fl could discriminate between theoretical models, two more sets of Fl data-points have been simu- 
lated [87], using different theoretical assumptions. The first of the two sets was generated using the 
MRSG95 [88] proton PDF, which has a large gluon density. The PDFs were then convoluted with the 
NNLO order coefficient functions, which are large and positive. This gives the "maximum" set of Fl 
data-points. In contrast, the second set has been generated using the MRST2003 [89] proton PDF, which 
has a negative gluon at low-x and low-Q^, thus providing a "minimum" set of Fl data. The original 
set of Fl points described in the previous subsection lies between these two extremes. The details of all 
three sets are summarised in Tab. 12. 

Figure 53 shows the results of including, individually, each set of simulated Fl data into the ZEUS 
NLO QCD fit. The results show that the NLO fit is relatively stable to the inclusion of the extreme sets 
of data. This indicates that a measurement of Fl could discriminate between certain theoretical models. 
However, it should be noted that the maximum and minimum models studied here were chosen specifi- 
cally to give the widest possible variation in Fl- There are many other alternatives that would lie between 
these extremes and the ability of an Fl measurement to discriminate between them would depend both 
on the experimental precision of the measurement itself, as well as the theoretical uncertainties on the 
models being tested. 
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Fig. 53: The distribution of the longitudinal structure function Fl at Q =5, 10 and 20 GeV . The blue, red and green points 
show the simulated Fl data-points, respectively labelled maximum, middle and minimum in Table 12. The blue, red and green 
shaded bands show the NLO QCD predictions, in the case where the data-points of the conesponding colour have been included 
in the fit. For comparison, the yellow shaded band shows the prediction of the ZEUS-JETS fit. 
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3.8 A Method to Include Final State Cross-sections Measured in Proton-Proton Collisions to 
Global NLO QCD Analysis 

The Large Hadron ColUder (LHC), currently under construction at CERN, will collide protons on pro- 
tons with an energy of 7 TeV. Together with its high collision rate the high available centre-of-mass 
energy will make it possible to test new interactions at very short distances that might be revealed in the 
production cross-sections of Standard Model (SM) particles at very high transverse momentum (Pt) as 
deviation from the SM theory. 

The sensitivity to new physics crucially depends on experimental uncertainties in the measure- 
ments and on theoretical uncertainties in the SM predictions. It is therefore important to work out a 
strategy to minimize both the experimental and theoretical uncertainties from LHC data. For instance, 
one could use single inclusive jet or Drell-Yan cross-sections at low Pt to constrain the PDF uncertain- 
ties at high Pt- Typical residual renormalisation and factorisation scale uncertainties in next-to-leading 
order (NLO) calculations for single inclusive jet-cross-section ai^e about 5 — 10% and should hopefully 
be reduced as NNLO calculations become available. The impact of PDF uncertainties on the other hand 
can be substantially larger in some regions, especially at large Pt, and for example at Pt = 2000 GeV 
dominate the overall uncertainty of 20%. If a suitable combination of data measured at the Tevatron and 
LHC can be included in global NLO QCD analyses, the PDF uncertainties can be constrained. 

The aim of this contribution is to propose a method for consistently including final-state observ- 
ables in global QCD analyses. 

For inclusive data like the proton structure function F2 in deep-inelastic scattering (DIS) the per- 
turbative coefficients are known analytically. During the fit the cross-section can therefore be quickly 
calculated from the strong coupling (as) and the PDFs and can be compared to the measurements. How- 
ever, final state observables, where detector acceptances or jet algorithms are involved in the definition of 
the perturbative coefficients (called "weights" in the following), have to be calculated using NLO Monte 
Carlo programs. Typically such programs need about one day of CPU time to calculate accurately the 
cross-section. It is therefore necessary to find a way to calculate the perturbative coefficients with high 
precision in a long run and to include Og and the PDFs "a posteriori". 

To solve this problem many methods have been proposed in the past [41,45,90-93]. In principle 
the highest efficiencies can be obtained by taking moments with respect to Bjorken-x [90,91], because 
this converts convolutions into multiplications. This can have notable advantages with respect to memory 
consumption, especially in cases with two incoming hadrons. On the other hand, there are complications 
such as the need for PDFs in moment space and the associated inverse Mellin transforms. 

Methods in x-space have traditionally been somewhat less efficient, both in terms of speed (in 
the 'a posteriori' steps — not a major issue here) and in terms of memory consumption. They are, 
however, somewhat more transparent since they provide direct information on the x values of relevance. 
Furthermore they can be used with any PDF. The use of x-space methods can be further improved by 
using methods developed originally for PDF evolution [94,95]. 

3.8.1 PDF-independent representation of cross-sections 

Representing the PDF on a grid We make the assumption that PDFs can be accurately represented 
by storing their values on a two-dimensional grid of points and using n*^-order interpolations between 
those points. Instead of using the parton momentum fraction x and the factorisation scale Q^, we use a 
variable transformation that provides good coverage of the full x and range with uniformly spaced 

Contributing authors: T. Carli, G. Salam, F. Siegert. 
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grid points:^^ 

1 

y(x) = ln- and r(Q2)=lnln^. (17) 

The parameter A is to be chosen of the order of Aqcd> but not necessarily identical. The PDF q{x,Q'^) 
is then represented by its values qiy^i^ at the 2-dimensional grid point {iy 6y,ir St), where 6y and 6t 
denote the grid spacings, and obtained elsewhere by interpolation: 

i=o L=o \ y / \ / 

where n, n' are the interpolation orders. The interpolation function (u) is 1 for n = i and otherwise 
is given by: 

4->(„) = (-')'■; "'"-"-'"-"^ (19) 

i\{n — I)] u — I 

Defining int(u) to be the largest integer such that int(u) < u, k and k are defined as: 

k{x)= int(fl-n^), K{x) = int - (20) 

Given finite grids whose vertex indices range from . . . A'^y — 1 for the y grid and . . . A'^,- — 1 for the r 
grid, one should additionally require that eq. (18) only uses available grid points. This can be achieved 
by remapping k max(0, min(A^j^ — 1 — n, k)) and k max(0, min(A'^^ — 1 — n', «;)). 

Representing the final state cross-section weights on a grid (DIS case) Suppose that we have an 
NLO Monte Carlo program that produces events m = 1 . . . A^. Each event m has an x value, Xm, a 
value, Q^, as well as a weight, Wm, and a con^esponding order in as, Pm- Normally one would obtain 
the final result W of the Monte Carlo integration from:^^ 

W=Y.^rn (^^Y^ q{Xm,Ql). (21) 
m=l V / 

Instead one introduces a weight grid \^ and then for each event updates a portion of the grid 

with: 

z = . . . n, i = . . . n' : 

wtt^, - <;r,U. + - 4"'^ - -) , (22) 

where k = k{xm), k = k{Q'^). 



The final result for W, for an arbitrary PDF, can then be obtained subsequent to the Monte Carlo run: 

2tt 



»' = E E E ,U-\Q'"'') , (23) 



where the sums index with iy and i^- run over the number of grid points and we have have explicitly 
introduced x^^y^ and Q^'"*^'' such that: 

y{x^^v)) = iy Sy and r (q^^'^^) = v St. (24) 



An alternative for the x grid is to use y = \nl/x + a{l — x) with a a parameter that serves to increase the density of points 
in the large x region. 

""Here, and in the following, renormalisation and factorisation scales have been set equal for simplicity. 
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Including renormalisation and factorisation scale dependence If one has the weight matrix W^^\^ 
determined separately order by order in Us, it is straightforward to vary the renormalisation /ijj and 
factorisation fip scales a posteriori (we assume that they were kept equal in the original calculation). 

It is helpful to introduce some notation relating to the DGLAP evolution equation: 



(ilnQ2 27r ' ^/v '-^ / 27r 

where the Pq and Pi are the LO and NLO matrices of DGLAP splitting functions that operate on vectors 
(in flavour space) q of PDFs. Let us now restrict our attention to the NLO case where we have just two 
values of p, pLO and pnlo- Introducing and corresponding to the factors by which one varies ^ur 
and fip respectively, for arbitrary ^ji and ^^t^ we may then write: 




\neFWi^^^f{P,®q)(x^'y\eFQ 



where /3o = {\lNc — 2nf) / (127r) and Nc (nj) is the number of colours (flavours). Though this formula is 
given for x-space based approach, a similar formula applies for moment-space approaches. Furthermore 
it is straightforward to extend it to higher perturbative orders. 



Representing the weights in the case of two incoming hadrons In hadron-hadron scattering one 
can use analogous procedures with one more dimension. Besides Q^, the weight grid depends on the 
momentum fraction of the first {xi) and second (^2) hadron. 

In the case of jet production in proton-proton collisions the weights generated by the Monte Carlo 
program as well as the PDFs can be organised in seven possible initial state combinations of paitons: 



gg: F^'^\xi,X2;Q^) = Gi(xi)G2(x2) (27) 

qg: fW(xi,X2;Q') = {Qi{xi) + Q^{xi)) G2{x2) (28) 

gq: F^^\xi,X2;Q^) = Gi{xi) {Q2{x2) + Q2{x2)) (29) 

qr: F(3)(xi, xs; Q^) = Qi{xi)Q2{x2) + Qi[xi)Q2{x2) - D{xi,X2) (30) 

qq: F^'^\xi,X2]Q^) = D{xi,X2) (31) 

qq: F^^\xi, X2; Q^) = D{xi,X2) (32) 

qr: F'-^^xi, X2;Q^) = Qi{xi)Q2{x2) + Qi{xi)Q2{x2) -D{xi, X2), (33) 



where g denotes gluons, q quarks and r quarks of different flavour q ^ r and we have used the generalized 



77 



PDFs defined as: 

6 -1 
Gh{x) = fo/H{x,Q^), Qh{x) = ^fi/H{x,Q^), Qh{x) = ^ fi/H{x,Q^), 



i=l j=— 6 

6 

Dixi,X2) = fi/Hli^^^Q^)fi/H,{x2,Q^), (34) 

i=-6 

6 

D{xi,X2,flF) = ^ fi/HAxi,Q^)f_iiH^{x2,Q'^), 
i=-6 

wliere fijH is the PDF of flavour i = —6 ... 6 for hadron H and Hi {H2) denotes the first or second 
hadron'*^ . 



The analogue of eq. 23 is then given by: 



6 



2(v) 



P 



Including scale depedence in the case of two incoming hadrons It is again possible to choose arbi- 
trary renormalisation and factorisation scales, specifically for NLO accuracy: 



27r 



where Fg^^p^^i^g^ is calculated as F^'^ but with qi replaced wtih Po®9i> and analogously for Pq^^Pg(^q2- 
3.8.2 Technical implementation 

To test the scheme discussed above we use the NLO Monte Carlo program NLOJET++ [96] and the 
CTEQ6 PDFs [18]. The grid W^''^^-^ of eq. 35 is filled in a NLOJET++ user module. This module 
has access to the event weight and parton momenta and it is here that one specifies and calculates the 
physical observables that are being studied (e.g. jet algorithm). 

Having filled the grid we construct the cross-section in a small standalone program which reads 
the weights from the grid and multiplies them with an arbitrary as and PDF according to eq. 35. This 
program runs very fast (in the order of seconds) and can be called in a PDF fit. 

The connection between these two programs is accomplished via a C-i~i- class, which provides 
methods e.g. for creating and optimising the grid, filling weight events and saving it to disk. The classes 
are general enough to be extendable for the use with other NLO calculations. 

The complete code for the NLOJET-i~i- module, the C-i~i- class and the standalone job is available 
from the authors. It is still in a development, testing and tuning stage, but help and more ideas are 
welcome. 



''^In the above equation we follow the standard PDG Monte Carlo numbering scheme [47] where gluons are denoted as 0, 
quarks have values from 1-6 and anti-quarks have the corresponding negative values. 
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The C++ class The main data members of this class are the grids implemented as aiTays of three- 
dimensional ROOT histograms, with each grid point at the bin centers^^: 

TH3D[p][l][iobs](xi,X2,Q2), (37) 

where the / and p are explained in eq. 35 and iobs denotes the observable bin, e.g. a given Pt range^°. 

The C++ class initialises, stores and fills the grid using the following main methods: 

• Default constructor: Given the pre-defined kinematic regions of interest, it initializes the grid. 

• Optimizing method: Since in some bins the weights will be zero over a large kinematic region in 
xi,X2,Q'^, the optimising method implements an automated procedure to adapt the grid bound- 
aries for each observable bin. These boundaries are calculated in a first (short) run. In the present 
implementation, the optimised grid has a fixed number of grid points. Other choices, like a fixed 
grid spacing, might be implemented in the future. 

• Loading method: Reads the saved weight grid from a ROOT file 

• Saving method: Saves the complete grid to a ROOT file, which will be automatically compressed. 

The user module for NLOJET++ The user module has to be adapted specifically to the exact defi- 
nition of the cross-section calculation. If a grid file already exists in the directory where NLOJET++ is 
started, the grid is not started with the default constructor, but with the optimizing method (see 3.8.2). In 
this way the grid boundaries are optimised for each observable bin. This is necessary to get very fine grid 
spacings without exceeding the computer memory. The grid is filled at the same place where the standard 
NLOJET++ histograms are filled. After a certain number of events, the grid is saved in a root-file and 
the calculation is continued. 



The standalone program for constructing the cross-section The standalone program calculates the 
cross-section in the following way: 

1 . Load the weight grid from the ROOT file 

2. Initialize the PDF interface^^ load q{x, Q^) on a helper PDF-grid (to increase the performance) 

3. For each observable bin, loop over iy-^ , iy^, ir,hp and calculate X2,Q'^) from the appropri- 
ate PDFs q{x,Q'^), multiply ctg and the weights from the grid and sum over the initial state parton 
configuration I, according to eq. 35. 

3.8.3 Results 

We calculate the single inclusive jet cross-section as a function of the jet transverse momentum {Pt) 
for jets within a rapidity of \y\ < 0.5. To define the jets we use the seedless cone jet algorithm as im- 
plemented in NLOJET++ using the four-vector recombination scheme and the midpoint algorithm. The 
cone radius has been put to R = 0.7, the overlap fraction was set to / = 0.5. We set the renormalisation 
and factorization scale to = P^max^ where PT,max is the Pt of the highest Pt jet in the required 
rapidity region^^. 

'*'R00T histograms are easy to implement, to represent and to manipulate. They are therefore ideal in an early development 
phase. An additional advantage is the automatic file compression to save space. The overhead of storing some empty bins 
is largely reduced by optimizing the Xi, X2 and grid boundaries using the NLOJET++ program before final filling. To 
avoid this residual overhead and to exploit certain symmetries in the grid, a special data class (e.g. a sparse matrix) might be 
constructed in the future. 

'"For the moment we construct a grid for each initial state parton configuration. It will be easy to merge the qg and the gq 
initial state parton configurations in one grid. In addition, the weights for some of the initial state parton configurations are 
symmetric in xi and X2. This could be exploited in future applications to further reduce the grid size. 

"We use the C++ wrapper of the LHAPDF interface [85]. 

'"Note that beyond LO the PT,max will in general differ from the Pt of the other jets, so when binning an inclusive jet 
cross section, the Pt of a given jet may not correspond to the renormalisation scale chosen for the event as a whole. For this 
reason we shall need separate grid dimensions for the jet Pt and for the renormalisation scale. Only in certain moment-space 
approaches [91] has this requirement so far been efficiently circumvented. 
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In our test runs, to be independent from statistical fluctuations (which can be large in particular 
in the NLO case), we fill in addition to the grid a reference histogram in the standard way according to 
eq. 21. 

The choice of the grid architecture depends on the required accuracy, on the exact cross-section 
definition and on the available computer resources. Here, we will just sketch the influence of the grid 
architecture and the interpolation method on the final result. We will investigate an example where 
we calculate the inclusive jet cross-section in A'obs = 100 bins in the kinematic range 100 < -Py < 
5000 GeV. In future applications this can serve as guideline for a user to adapt the grid method to 
his/her specific problem. We believe that the code is transparent and flexible enough to adapt to many 
applications. 

As reference for comparisons of different grid architectures and interpolation methods we use the 
following: 

• Grid spacing in y{x): 10"'^ < xi, a;2 < 1.0 with Ny = 30 

• Grid spacing in t{Q^): 100 GeV <Q < 5000 GeV with Nr = 30 

• Order of interpolation: Uy = 3, rir = 3 

The grid boundaries correspond to the user setting for the first run which determines the grid boundaries 
for each observable bin. In the following we call this grid architecture 30^x30x100(3,3). Such a grid 
takes about 300 Mbyte of computer memory. The root-file where the grid is stored has about 50 Mbyte. 

The result is shown in Fig. 54a). The reference cross-section is reproduced everywhere to within 
0.05%. The typical precision is about 0.01%. At low and high Pt there is a positive bias of about 
0.04%. Also shown in Fig. 54a) are the results obtained with different grid architectures. For a finer 
X grid (50^x30x100(3, 3)) the accuracy is further improved (within 0.005%) and there is no bias. A 
finer (30^x60x100(3, 3)) as well as a coarser (30^x10x100(3, 3)) binning in does not improve the 
precision. 

Fig. 54b) and Fig. 54c) show for the grid (30^x30x100) different interpolation methods. With an 
interpolation of order 77, = 5 the precision is 0.01% and the bias at low and high Pt observed for the 
n = 3 interpolation disappears. The result is similar to the one obtained with finer a; -points. Thus by 
increasing the interpolation order the grid can be kept smaller. An order n = 1 interpolation gives a 
systematic negative bias of about 1% becoming even larger towards high Pt- 

Depending on the available computer resources and the specific problem, the user will have to 
choose a proper grid architecture. In this context, it is interesting that a very small grid 10^x10x100(5, 5) 
that takes only about 10 Mbyte computer memory reaches still a precision of 0.5%, if an interpolation of 
order n = 5 is used (see Fig. 54d)). 

We have developed a technique to store the perturbative coefficients calculated by an NLO Monte 
Carlo program on a grid allowing for a-posteriori inclusion of an arbitrary parton density function (PDF) 
set. We extended a technique that was already successfully used to analyse HERA data to the more 
demanding case of proton-proton collisions at LHC energies. 

The technique can be used to constrain PDF uncertainties, e.g. at high momentum transfers, from 
data that will be measured at LHC and allows the consistent inclusion of final state observables in global 
QCD analyses. This will help increase the sensitivity of LHC to find new physics as deviations from the 
Standard Model predictions. 

Even for the large kinematic range for the parton momentum fractions xi and X2 and of the squared 
momentum transfer accessible at LHC, grids of moderate size seem to be sufficient. The single 
inclusive jet cross-section in the central region \y\ < 0.5 can be calculated with a precision of 0.01% 
in a realistic example with 100 bins in the transverse jet energy range 100 < Pt < 5000 GeV. In this 
example, the grid occupies about 300 Mbyte computer memory. With smaller grids of order 10 Mbyte 
the reachable accuracy is still 0.5%. This is probably sufficient for all practical applications. 
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Fig. 54: Ratio between the single inclusive jet cross-section with 100 Pt bins calculated with the grid technique and the 
reference cross-section calculated in the standard way. Shown are the standard grid, grids with finer x and sampling (a) 
with interpolation of order 1, 3 and 5 (b) (and on a finer scale in c)) and a small grid (d). 
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4 GLAP evolution and parton fits 
4.1 Introduction 

The high-precision data from HERA and the anticipated data from LHC open the possibihty for a precise 
determination of parton distributions. This, however, requires an improvement in the theoretical descrip- 
tion of DIS and hard hadronic scattering processes, as well as an improvement of the techniques used to 
extract parton distributions from the data. 

The determination of perturbative QCD corrections has undergone substantial progress recently. 
The key ingredient of a complete next-to-next-to-leading order (NNLO) prediction in perturbative QCD 
are the recently calculated three-loop splitting functions which govern the scale dependence of PDFs. 
Extensions in the accuracy of the perturbative predictions yet beyond NNLO are given by the three-loop 
coefficient functions for F2, while the coefficient functions for Fl at this order are actually required to 
complete the NNLO predictions. Section 4.2 briefly discusses the recent results and their phenomenolog- 
ical impUcations. Certain mathematical aspects, which are important in the calculation of higher order 
corrections in massless QCD are presented in section 4.3. In particular, algebraic relations in Mellin- 
N space are pointed out, which are of importance for harmonic sums, harmonic poly logarithms and 
multiple (^-values. 

These calculation of the PDF evolution to NNLO in perturbative QCD are used in section 4.4 to 
provide an update and extension of a set of benchmark tables for the evolution of parton distributions of 
hadrons. These benchmark tables were first presented in the report of the QCD/SM working group at the 
2001 Les Houches workshop, but based on approximate NNLO splitting functions, which are superseded 
by the exact results which are now available. In addition, section 4.4 now includes also reference tables 
for the case of polarized PDF evolution. 

Whereas in principle the x-shapes of PDFs at low scales can be determined from first principles 
using non-perturbative methods, in practice at present this is only possible using models (briefly touched 
in in section 4.5). Therefore, an accurate determination of PDFs requires a global QCD fit to the data, 
which is the subject of sections 4.6^.8. 

Section 4.6 discusses in particular the impact on parton fits of NNLO corrections on the one hand, 
and of the inclusion of Drell-Yan data and future LHC data on the other hand. It then presents values 
for a benchmark fit together with a table of con^elation coefficients for the parameter obtained in the 
fit. This benchmark fit is then re-examined in sec. 4.7, along with a comparison between PDFs and the 
associated uncertainty obtained using the approaches of Alekhin and the MRST group. The differences 
between these benchmark partons and the actual global fit partons are also discussed, and used to explore 
complications inherent in extracting PDFs with uncertainties. Finally, in section 4.8 the stability of PDF 
determinations in NLO global analyses is re-investigated and the results of the CTEQ PDF group on this 
issue are summarized. 

An alternative approach to a completely bias-free parameterization of PDFs is presented in sec- 
tion 4.9. There, a neural network approach to global fits of parton distribution functions is introduced 
and work on unbiased parameterizations of deep-inelastic structure functions with faithful estimation of 
their uncertainties is reviewed together with a summary of the current status of neural network parton 
distribution fits. 



Subsection coordinators: A. Glazov, S. Moch 
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4.2 Precision Predictions for Deep-Inelastic Scattering 

With high-precision data from HERA and in view of the outstanding importance of hard scattering pro- 
cesses at the LHC, a quantitative understanding of deep-inelastic processes is indispensable, necessitating 
calculations beyond the standard next-to-leading order of perturbative QCD. 

In this contribution we review recent results for the complete next-to-next-to-leading order (NNLO, 
N^LO) approximation of massless perturbative QCD for the structure functions Fi, F2, and Fl in 
DIS. These are based on the second-order coefficient functions [97-101], the three-loop splitting func- 
tions which govern the evolution of unpolarized parton distributions of hadrons [102, 103] and the three- 
loop coefficient functions for Fl = F2 — 2xFi in electromagnetic (photon-exchange) DIS [62, 104]. 
Moreover we discuss partial N'^LO results for F2, based on the corresponding three-loop coefficient 
functions also presented in Ref. [104]. For the splitting functions P and coefficient functions C we 
employ the convention 

n=0 n=0 

for the expansion in the running coupling constant a^. For the longitudinal structure function Fl the 
third-order corrections are required to complete the NNLO predictions, since the leading contribution to 
the coefficient function Cl is of first order in the strong coupling constant Og. 

In the following we briefly display selected results to demonstrate the quality of precision pre- 
dictions for DIS and their effect on the evolution. The exact (analytical) results to third order for the 
quantities in Eq. (38) are too lengthy, about 0(100) pages in normalsize fonts and will not be repro- 
duced here. Also the method of calculation is well documented in the literature [101-106]. In particular-, 
it proceeds via the Mellin transforms of the functions of the Bjorken variable x, 

1 

A{N) = Jdxx^-^A{x) . (39) 


Selected mathematical aspects of Mellin transforms are discussed in section 4.3. 
4.2.1 Parton evolution 

The well-known 2nj — 1 scalar- non-singlet and 2x2 singlet evolution equations for n j flavors read 

qL = Ks^qL, i = ±,y, (40) 

for the quark flavor asymmetries q^^ and the valence distribution q^^, and 



dln/x| \ g J \ Pgq Pgg J \ 9 



(41) 



for the singlet quark distribution and the gluon distribution g, respectively. Eqs. (40) and (41) are 
governed by three independent types of non-singlet splitting functions, and by the 2x2 matrix of sin- 
glet splitting functions. Here ® stands for the Mellin convolution. We note that benchmark numerical 
solutions to NNLO accuracy of Eqs. (40) and (41) for a specific set of input distributions are given in sec- 
tion 4.4. Phenomenological QCD fits of parton distributions in data analyses are extensively discussed 
in sections 4.6-4.8. An approach based on neural networks is described in section 4.9. 

Let us start the illustration of the precision predictions by looking at the parton evolution and at 
large Mellin- (large Bjorken-x) behavior. Fig. 55 shows the stability of the perturbative expansion 

''^Contributing authors: S. Mochi, J.A.M. Vermaseren, A. Vogt 
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which is very benign and indicates, for Qs ^ 0.2, corrections of less than 1% beyond NNLO. On the left 
we show the results for the perturbative expansion of Pns in Mellin space, cf. Eqs. (38), (39). We employ 
four active flavors, nj = 4, and an order-independent value for the strong coupling constant, 

as(/io') = 0.2 , (42) 

which corresponds to ~ 25 ... 50 GeV^ for as(Mj) = 0.114 . . . 0.120 beyond the leading order. 
On the right of Fig. 55 the perturbative expansion of the logarithmic derivative, cf. Eqs. (38), (40), 
is illustrated at the standard choice /i^ = fij of the renormalization scale. We use the schematic, but 
characteristic model distribution, 

xqU^,fi^) = x0-5(l-x)3. (43) 
The normalization of q ns is irrelevant at this point, as we consider the logarithmic scale derivative only. 
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X X 



Fig. 56: The three-loop gluon-quark (left) and gluon-gluon (right) splitting functions together with the leading small-a:: contri- 
bution (dotted line). 

Next, let us focus on the thr^ee-loop splitting functions at small momentum fractions x, where the 
splitting functions Pjg in the lower row of the 2x2 matrix in Eq. (41), representing g i splittings, 

(2) 

are most important. In Fig. 56 we show, again for n j = 4, the three-loop splitting functions Pqg and 
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Pgg together with the leading small-x term indicated separately for x < 0.01. In the present singlet 
case the leading logarithmic small-x limits ~ Inx of Refs. [107, 108] are confirmed together with 
the general structure of the BFKL limit [29, 30, 109]. The same holds for the leading small-x terms In^ x 
in the non-singlet sector [110, 111], with the qualification that a new, unpredicted leading logarithmic 
contribution is found for the color factor d'^'^^dabc entering at three loops for the first time. 

It is obvious from Fig. 56 (see also Refs. [101-103, 106]) that the leading x — > 0-terms alone are 
insufficient for collider phenomenology at HERA or the LHC as they do not provide good approximations 
of the full results at experimentally relevant small values of x. Resummation of the small-x terms and 
various phenomenological improvements are discussed in detail in section 5. 
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Fig. 57: The perturbative expansion of the scale derivatives (41) of the singlet distributions (44). 

In the same limit of small x, it is instructive to look at the evolution of parton distributions. Again, 
we choose the reference scale of Eq. (42), nj = 4 and the sufficiently realistic model distributions 

xqsix,iJ,^) = 0.6x-°-'\l-xf-^{l + 5.0x'^-^) 

xg{x,n^) = 1.6x-°-^(l -x)^-^(l -0.6j;°-2) (44) 

irrespective of the order of the expansion to facilitate direct comparisons of the various contributions. 
Of course, this order-independence does not hold for actual data-fitted parton distributions like those in 
sections 4.6-4.8. In Fig. 57 we display the perturbative expansion of the scale derivative for the singlet 
quark and gluon densities at ;u| = /Uq for the initial conditions specified in Eqs. (42) and (44). For the 
singlet quark distribution the total NNLO corrections, while reaching 10% at x = 10 remain smaller 
than the NLO results by a factor of eight or more over the full x-range. For the gluon distribution already 
the NLO corrections are small and the NNLO contribution amounts to only 3% for x as low as 10 
Thus, we see in Fig. 57 that the perturbative expansion is very stable. It appears to converge rapidly at 
X > 10^'^, while relatively large third-order corrections are found for very small momenta x < 10~^. 

4.2.2 Coefficient functions 

While the previous considerations were addressing the evolution of parton distributions, we now turn to 
the further improvements of precision predictions due to the full third-order coefficient functions for the 
structure functions F2 and Fl in electromagnetic DIS [62, 104]. The results for Fl complete the NNLO 
description of unpolaiized electromagnetic DIS, and the third-order coefficient functions for F2 form, at 
not too small values of the Bjorken variable x, the dominant part of the next-to-next-to-next-to-leading 
order (N^LO) corrections. Thus, they facilitate improved determinations of the strong coupling from 
scaling violations. 
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Fig. 58: The three-loop non-singlet coefficient function C2^^{x) in the large-x (left) and the small-x (right) region, multiplied 
by (1 — a;) for display purposes. 



Let us start with the three-loop coefficient functions for F2 in the non-singlet case. In Fig. 58 we 

(3) 

display the three-loop non-singlet coefficient function C2^^{x) for n/- = 4 flavors. We also show the 
soft-gluon enhanced terms dominating the large-x limit, 



(45) 



(l-x)+ ' 

and the small-x approximations obtained by successively including enhanced logarithms In^ x. How- 
ever the latter are insufficient for an accurate description of the exact result. The dashed band in Fig. 58 
shows the uncertainty of previous estimates [112] mainly based on the calculation of fixed Mellin mo- 
ments [113-115]. For a detailed discussion of the soft-gluon resummation of the the terms, we refer 
to section 5. 
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Fig. 59: The perturbative expansion of the logarithmic scale derivative of the non-singlet structure function f2,ns. The results 
up to NNLO are exact, while those at N'^LO are very good approximations. The N^'LO corrections have been estimated by 
various methods. 

Building on the coefficient functions, it is interesting to study the perturbative expansion of the 
logarithmic scale derivative for the non-singlet structure function F2^ns- To that end we use in Fig. 59 
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again the input shape Eq. (43) (this time for -F2,ns itself) iiTespective of the order of the expansion, 
Uf = A flavors and the reference scale of Eq. (42). The N^LO approximation based on Pade summations 
of the perturbation series can be expected to correctly indicate at least the rough size of the four-loop 
corrections, see Ref. [104] for details. From Fig. 59 we see that the three-loop results for F2 can be 
employed to effectively extend the main part of DIS analyses to the N'^LO at x > 10~^ where the effect 
of the unknown fourth-order splitting functions is expected to be very small. This has, for example, the 
potential for a 'gold-plated' determination of as{Mz) with an error of less than 1% from the truncation 
of the perturbation series. On the right hand side of Fig. 59 the scale uncertainty which is conventionally 
estimated by 

A/ = ^ (max[f{x,fxl)]-mm[f{x,fxl)]^ , (46) 
is plotted, where the scale varies /i^ G [Q/2, 2Q]. 




Fig. 60: The perturbative expansion up to three loops (N'^LO) of the quark (left) and gluon (right) contributions to singlet 
structure function F2. 

In the singlet case, we can study the quark and gluon contributions to the structure function F2. In 
Fig. 60 we plot the perturbative expansion up to N^LO of the quark and gluon contributions to structure 
function F2^s at the scale (42) using the distributions (44). All curves have been normalized to the 
leading-order result = (e^) qg . Fig. 60 nicely illustrates the perturbative stability of the structure 

function F2. 

Finally, we address the longitudinal structure function Fl at three loops. In the left part of Fig. 61 
we plot the singlet-quark and gluon coefficient functions c/, q and ci^g for Fl up to the third order for 
four flavors and the Og-value of Eq. (42). The curves have been divided by = as/(47r) to account 
for the leading contribution being actually of first order in the strong coupling constant Og. Both the 
second-order and the third-order contributions are rather large over almost the whole x-range. Most 
striking, however, is the behavior at very small values of x, where the anomalously small one-loop parts 
are negligible against the (negative) constant two-loop terms, which in turn are completely overwhelmed 
by the (positive) new three-loop corrections xc"^\ ~ In x + const , which we have indicated in Fig. 61. 

To assess the effect for longitudinal structure function Fl, we convolute in Fig. 61 on the right 
the coefficient functions with the input shapes Eq. (44) for = 4 flavors and the reference scale of 
Eq. (42). A comparison of the left and right plots in Fig. 61 clearly reveals the smoothening effect of 
the Mellin convolutions. For the chosen input conditions, the (mostly positive) NNLO corrections to the 
flavor-singlet Fl amount to less than 20% for 5 • 10"^ < 2; < 0.3. In data fits we expect that the parton 
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Fig. 61: The perturbative expansion to N^LO of tiie longitudinal singlet-quark and gluon coefficient functions to third order 
multiplied by x for display purposes (left) and of the quark and gluon contributions to singlet structure function Fl (right). 



distributions, in particular the gluon distribution, will further stabilize the overall NNLO/NLO ratio. 
Thus, at not too small scales, Fl is a quantity of good perturbative stability, for the x-values accessible 
at HERA, see Ref. [62] for more details. 
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4,3 Mathematical Structure of Higher Order Corrections 

The QCD anomalous dimensions and Wilson coefficients for structure functions are single scale quan- 
tities and may be expressed in simple form in Mellin space in terms of polynomials of harmonic sums 
and ration functions of the Mellin variable. Unlike the case in various calculations using representations 
in momentum-fraction (z-) space the use of multiple nested harmonic sums leads to a synchr^oniza- 
tion in language. Furthermore, significant simplifications w.r.t. the number of functions needed can be 
achieved. This is due to algebraic [116, 117] relations between these quantities, which in a similar way 
are also present between harmonic polylogarithms [118] and multiple ^-values [119]. These relations 
result from the the specific index pattern of the objects considered and their multiplication relation and 
do not refer to further more specific properties. In Table 13 we illustrate the level of complexity which 
one meets in case of harmonic sums. To three-loop order weight w=6 harmonic sums occur. The alge- 
braic relations for the whole class of harmonic sums lead to a reduction by a factor of ~ 4 (column 3). 
As it turns out, physical pseudo-observables, as anomalous dimensions and Wilson-coefficients in the 
MS scheme, to 2- resp. 3-loop order depend on harmonic sums only, in which the index {—1} never 
occurs. The algebraic reduction for this class is illustrated in column 5. We also compai^e the complexity 
of only non-alternating harmonic sums and their algebraic reduction, which is much lower. This class of 
sums is, however, not wide enough to describe the above physical quantities. In addition to the algebraic 
relations of harmonic sums structural relations exist, which reduces the basis further [120]. 
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Table 13: Number of alternating and non-alternating harmonic sums in dependence of their weight, [120]. 



Using all these relations one finds that 5 basic functions are sufficient to describe all 2-loop Wilson 
coefficients for deep-inelastic scattering [121] and further 8 [122] for the 3-loop anomalous dimensions. 
Their analytic continuations to complex values of the Mellin variable are given in [123, 124]. These 
functions are the (regularized) Mellin transforms of : 



ln(l + x) 5*1,2(3;) Li4(x) 

(47) 



1 + x' libx' libx' Xibl' 

Sl,3ix) S2,2ix) Ul{x) 52,2(-x)-Li2(-x)/2 



1 + X X ±1 1 + X X ±1 

It is remarkable, that the numerator-functions in (47) are Nielsen integrals [125] and polynomials thereof, 
although one might expect harmonic polylogarithms [118] outside this class in general. The represen- 
tation of the Wilson coefficients and anomalous dimensions in the way described allows for compact 
expressions and very fast and precise numerical evaluation well suited for fitting procedures to experi- 
mental data. 



Contributing authors: J. Blumlein, H. Bottcher, A. Guffanti, V. Ravindran 
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4.3.1 Two-loop Processes at LHC in Mellin Space 

Similar to the case of the Wilson coefficients in section 4.3 one may consider the Wilson coefficients 
for inclusive hard processes at hadron colliders, as the Drell-Yan process to O(a^) [126-128], scalar 
or pseudoscalar Higgs-boson production to 0(a;^) in the heavy-mass limit [129-134], and the 2-loop 
time-like Wilson coefficients for fragmentation [135-137]. These quantities have been analyzed in [138, 
139] w.r.t. their general structure in Mellin space. The cross section for the Drell-Yan process and Higgs 
production is given by 



(T 



dxi 

Xl 



dX2 



fa{xl,^J,'^)fb{x2,^^^ 
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2 



XiX2 jJL 



(4.48) 



with X = s/s. Here, fc{x, /i^) are the initial state parton densities and denotes the factorization scale. 
The Wilson coefficient of the process is a and is the time-like virtuality of the s-channel boson. 
Likewise, for the fragmentation process of final state partons into hadrons in pp-scattering one considers 
the double differential final state distribution 



dxd cos 9 



do^ 

dx 



+ - sm^ Q—r^ 

4 dx 



(4.49) 



Here, 



dx 



1 dz 
z 



+ <(^,M2)cf,,(.,QVM^)} 



N 



f 



p=l 



(4.50) 



In the subsystem cross-sections a the initial state parton distributions are included. denote the non- 
perturbative fragmentation functions and C^'f^{z, Q"^ /M"^) the respective time-like Wilson coefficients 
describing the fragmentaion process for a parton i into the hadron H. 

Although these Wilson coefficients are not directly related to the 2-loop Wilson coefficients for 
deeply inelastic scattering, one finds for these functions at most the same set of basic functions as given 
above. Again one obtains very fast and concise numerical programs also for these processes working in 
Mellin space, which will be well suited for inclusive analyses of experimental collider data at LHC in 
the future. 



4.3.2 Non-Singlet Parton Densities at 0(q;^) 

The precision determination of the QCD-scale Aqcd and of the idividual parton densities is an important 

issue for the whole physics programme at LHC since all measurements rely on the detailed knwoledge 

of this parameter and distribution functions. In Ref. [140] first results were reported of a world data 

analysis for charged lepton-p((i) scattering w.r.t. the flavor non-singlet sector at O(a^) accuracy. The 

flavor non-singlet distributions xuv{x, Q^) and xdi,{x, Q"^) were determined along with fully con^elated 

error bands giving parameterizations both for the values and errors of these distributions for a wide range 

in X and Q^. In Figure 62 these distributions including their error are shown. The value of the strong 

coupling constant as(Af|) was determined as 0.1135 + 0.0023 — 0.0026 (exp.) The full analysis is 

given in [141], including the determination of higher twist contributions in the large x region both for 
Ff(x,g2)andF2^(x,Q2). 



4. 3. 3 Scheme-invariant evolution for unpolarzed DIS structure functions 

The final HERA-II data on unpolarized DIS structure functions, combined with the present world data 
from other experiments, will allow to reduce the experimental error on the strong coupling constant. 
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Fig. 62: xu^ and xd^ at Qq = 4GeV^ (full lines) [140]; dashed lines [142]; dash-dotted lines [39]. 



OsiM"^), to the level of 1% [143]. On the theoretical side the NLO analyzes have intrinsic limitations 
which allow no better than 5% accuracy in the determination of Os [144]. In order to match the expected 
experimental accuracy, analyzes of DIS structure functions need then to be carried out at the NNLO- 
level. To perform a full NNLO analysis the knowledge of the 3-loop /3-function coefficient, j32, the 2- 
resp. 3-loop Wilson coefficients and the 3-loop anomalous dimensions is required. With the calcula- 
tion of the latter [102, 103], the whole scheme-independent set of quantities is known, thus allowing a 
complete NNLO study of DIS structure functions. 

Besides the standard approach solving the QCD evolution equations for parton densities in the IViS 
scheme it appears appealing to study scheme-invariant evolution equations [145]. Within this approach 
the input distributions at a scale Qq are measured experimentally. The only parameter to be determined 
by a fit to data is the QCD-scale Aqcd- To perform an analysis in the whole kinematic region the 
non-singlet [140] contribution has to be separated from the singlet terms of two measured observables. 
In practice these can be chosen to be F2{x, Q^) and dF2{x, Q^)/91n(Q2) or F2(x, Q^) and Fl{x, Q^) 
if the latter structure function is measured well enough. Either dF2{x,Q^)/dln{Q^) or Fl{x,Q'^) 
play a role synonymous to the gluon distribution while F2{x,Q'^) takes the role of the singlet-quark 
distribution compared to the standard analysis. These equations do no longer describe the evolution of 
universal quantities depending on the choice of a scheme but of process-dependent quantities which 
are observables and thus factorization scheme-indedependent. Since the respective evolution kernels 
are calculated in perturbation theory the dependence on the renormalization scale remains and becomes 
smaller with the order in the coupling constant included. 

Physical evolution kernels have been studied before in [146-148]. The 3-loop scheme-invariant 
evolution equations were solved in the massless case in [145]. This analysis is extended including the 
heavy flavor contributions at present [141]. The large complexity of the evolution kernels can only 
be handeled in IMellin space since in 2-space various inverse and direct IVIellin convolutions would be 
required numerically, causing significant accuracy and run-time problems. The inclusion of the heavy 
flavor contributions is possible using the parameterizations [149]. 

In Fig. 62 we present the scheme invariant evolution for the structure functions F2 and dF2 / dt 
to NNLO with t = — 2//3o ln(as((5^)/as((5o))- The input distribution at the reference scale are not 
extracted from data, but rather built up as a convolution of Wilson coefficients and PDFs, the latter being 
parametrised according to [150]. 

Scheme-invariant evolution equations allow a widely un-biased approach to determine the initial 
conditions for QCD evolution, which in general is a source of systematic effects which are difficult to 
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Fig. 63: NNLO scheme invariant evolution for the singlet part of the structure function F2 and its slope dF^jdt for four 
massless flavours, [145]. 



control. On the other hand, their use requires to consider all correlations of the input measurements in 
a detailed manner experimentally. At any scale mappings are available to project the observables 
evolved onto the quark-singlet and the gluon density in whatever scheme. In this way the question 
whether sign changes in the unpolarized gluon distribution in the MS scheme do occur or do not occur 
in the small x region can be answered uniquely. As in foregoing analyses [140, 151] correlated error 
propagation throughout the evolution is being performed. 



92 



4.4 Updated reference results for the evolution of parton distributions 

In this contribution we update and extend our benchmark tables, first presented in the report of the 
QCD/SM working group at the 2001 Les Houches workshop [150], for the evolution of parton distri- 
butions of hadrons in perturbative QCD. Since then the complete next-to-next-to-leading order (NNLO) 
splitting functions have been computed [102, 103], see also section 4.2. Thus we can now replace the 
NNLO results of 2001 which were based on the approximate splitting functions of Ref. [152]. Further- 
more we now include reference tables for the polarized case treated in neither Ref. [150] nor the earlier 
study during the 1995/6 HERA workshop [153]. Since the spin-dependent NNLO splitting functions are 
still unknown, we have to restrict ourselves to the polarized leading-order (LO) and next-to-leading-order 
(NLO) evolution. 

As in Ref. [150], we employ two entirely independent and conceptually different FORTRAN pro- 
grams. At this point, the x-space code of G.S. is available from the author upon request, while the Mellin- 
space program of A.V. has been published in Ref. [154]. The results presented below correspond to a di- 
rect iterative solution of the N™LO evolution equations for the parton distributions fp{x, /if ) = p{x, fij), 
where p = qi, qi,g with i = 1, . . . , A'^f, 



din /if 



E -^'(f^) , V E^?(^'^) fAv^^^h (4-51) 
(=0 ^' p' \y f^r / 



p 

with the strong coupling, normalized as Ug = as/ (47r), given in terms of 

da, 



/3n-lo(«s) = -E«i+'A (4.52) 



with (3o = 11 — 2/3A'f etc. fi^ and /if represent the renormalization and mass-factorization scales in the 
MS scheme. The reader is referred to Refs. [150, 154] for the scale dependence of the splitting functions 
p(') and a further discussion of our solutions of Eqs. (4.51) and (4.52). 

For the unpolarized case we retain the initial conditions as set up at the Les Houches meeting: The 
evolution is started at 



2GeV2 . (4.53) 



Roughly along the lines of the CTEQ5M parametrization [155], the input distributions are chosen as 

xu^{x, /i^o) = 5.107200 x°-* (1 - xf 

xd^{x, i4fl) = 3.064320 x°-^ (1 - x)"" 

x5(x,/ifo) = 1.700000 x"°-^ (1 -x)^ (4.54) 

xd{x,i4fl) = . 1939875 x"°-^ (1 -x)^ 

xu(x,/if o) = (1 - x) xJ(x,/if o) 

xs(x,/ifQ) = xs(x,/ifQ) = 0.2 x(n + J)(x, /if q) 

where, as usual, v = Qi — Qi- The running couplings are specified by Eq. (4.52) and 

as(/i? = 2GeV2) = 0.35 . (4.55) 

For simpUcity initial conditions (4.54) and (4.55) are employed regardless of the order of the evolution 
and the (fixed) ratio of the renormaUzation and factorization scales. 

For the evolution with a fixed number A'^f > 3 of quark flavours the quark distributions not spec- 
ified in Eq. (4.54) ai^e assumed to vanish at /if q, and Eq. (4.55) is understood to refer to the chosen 

"■^Contributing authors: G.P. Salam, A. Vogt 
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value of A^f . For the evolution with a variable A^f = 3 . . . 6, Eqs. (4.53) and (4.54) always refer to thr^ee 
flavours. A'f is then increased by one unit at the heavy-quark pole masses taken as 

mc = Hffl , = 4.5 GeV^ , rrit = 175 GeV^ , (4.56) 

i.e., Eqs. (4.51) and (4.52) are solved for a fixed number of flavours between these thresholds, and the 
respective matching conditions are invoked at //f = , h = c, b, t. The matching conditions for the 
unpolarized parton distributions have been derived at NNLO in Ref. [156], and were first implemented in 
an evolution program in Ref. [157]. Note that, while the parton distributions are continuous up to NLO 
due to our choice of the matching scales, as is discontinuous at these flavour thresholds already at this 
order for fi^ ^ fi^, see Refs. [158, 159]. Again the reader is referred to Refs. [150, 154] for more details. 

Since the exact NNLO splitting functions P^"^^ are rather lengthy and not directly suitable for 
use in a Mellin-space program (see, however, Ref. [124]), the reference tables shown below have been 
computed using the parametrizations (4.22) - (4.24) of Ref. [102] and (4.32) - (4.35) of Ref. [103]. 
Likewise, the operator matrix element A^^ entering the NNLO flavour matching is taken from Eq. (3.5) 
of Ref. [154]. The relative eiTor made by using the parametrized splitting functions is illustrated in 
Fig. 64. It is generally well below 10~^, except for the very small sea quark distributions at very large x. 

Eqs. (4.53), (4.55) and (4.56) 
Eq. (4.54) replaced by the sufficiently 

xg 
xd 
xs 

As Eq. (4.54) in the unpolarized case, this input is employed regai^dless of the order of the evolution. 

As in Ref. [150], we have compared the results of our two evolution programs, under the conditions 
specified above, at 500 x-/i£ points covering the range 10-8 < X < 0.9 and 2 GeV^ < ^4 < 10^ GeV^. 
A representative subset of our results at ^u^ = 10^ GeV^, a scale relevant to high-£'r jets and close to 
m^, m| and, possibly, m"^^^^^, is presented in Tables 14-18. These results are given in terms of the 
valence distributions, defined below Eq. (4.54), L± = d±u, and the quark-antiquark sums q+= q — q 
for q = s, c and, for the variable- A'^f case, b. 

For compactness an abbreviated notation is employed throughout the tables, i.e., all numbers a- lO'' 
are written as a^. In the vast majority of the x-^f points our results are found to agree to all five figures 
displayed, except for the tiny NLO and NNLO sea-quark distributions at x = 0.9, in the tables. Entries 
where the residual offsets between our programs lead to a different fifth digit after rounding are indicated 
by the subscript In these cases the number with the smaller modulus is given in the tables. 

The approximate splitting functions [152], as mentioned above employed in the previous version 
[150] of our reference tables, have been used in (global) NNLO fits of the unpolarized parton distributions 
[39,40], which in turn have been widely employed for obtaining NNLO cross sections, in particular for 
W and Higgs production. The effect of replacing the approximate results by the full splitting functions 
[102, 103] is illustrated in Figure 65. Especially at scales relevant to the above-mentioned processes, 
the previous approximations introduce an error of less than 0.2% for x > lO^'^, and less than 1% even 
down to X ~ IQ-^. Consequently the splitting-function approximations used for the evolution the parton 
distributions of Refs. [39,40] are confirmed to a sufficient accuracy for high-scale processes at the LHC. 

The unchanged unpolarized LO and NLO reference tables of Ref. [150] are not repeated here. 
Note that the one digit of the first (FFN) value was mistyped in the header of Table 1 in that report , 

"■'We thank H. Bottcher and J. Bliimlein for pointing tliis out to us. 



are used for the (longitudmally) 
realistic toy input [154] 

= +1.3x°-^(l-x)^(l + 3x) 

= -0.5x°-^(l-x)^(l + 4x) 

= +1.5 x^-^ (1 - xf 

= xu = -0.05 x°-^ (1 -a;)^ 

= xs = +0.5 xd . 



polarized case as well, where 



(4.57) 
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Fig. 64: Relative effects of using the parametrized three-loop splitting functions of Refs. [102, 103], instead of the exact 
expressions from the same source, on the NNLO evolution for the input (4.53) - (4.55) at two representative values of jj, = 
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the coiTect value can be found in Table 3 below. 
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Table 14: Reference results for the A'f = 4 next-next-to-leading-order evolution for the initial conditions (4.53) - (4.55). The 
corresponding value of the strong coupling is Qs(A*r ~ 10* GeV^) — 0.110141. The valence distributions and Cy are equal 
for the input (4.54). The notation is explained below Eq. (4.54) and in the paragraph below Eq. (4.57). 



NNLO, Aff = 4 , /xf = 10^ GeV^ 


X 








2xL+ 


XSy 


XS+ 


XC+ 


xg 




10-7 


1.5287" 


-4 


1.0244" 


-4 


5.7018" 


-6 


1.3190-^2 


3.1437" 


-5 


6.4877+1 


6.4161+1 


9.9763+2 


10-6 


6.9176" 


-4 


4.4284" 


-4 


2.5410" 


-5 


6.8499-^1 


9.4279" 


-5 


3.3397+1 


3.2828+1 


4.9124+2 


10-5 


3.0981" 


-3 


1.8974" 


-3 


1.0719" 


-4 


3.3471"^^ 


2.2790" 


-4 


1.6059+1 


1.5607+1 


2.2297+2 


10-4 


1.3722" 


-2 


8.1019" 


-3 


4.2558" 


-4 


1.5204-^1 


3.6644" 


-4 


7.0670+° 


6.7097+° 


9.0668+1 


10-3 


5.9160" 


-2 


3.4050" 


-2 


1.6008" 


-3 


6.3230"^° 


1.4479" 


-4 


2.7474+° 


2.4704+° 


3.1349+1 


10-2 


2.3078" 


-1 


1.2919" 


-1 


5.5688" 


-3 


2.2752"^° 


-5.7311" 


-4 


8.5502-1 


6.6623-1 


8.1381+° 


0.1 


5.5177" 


-1 


2.7165" 


-1 


1.0023" 


-2 


3.9019-1 


-3.0627" 


-4 


1.1386-1 


5.9773-2 


9.0563-1 


0.3 


3.5071" 


-1 


1.3025" 


-1 


3.0098" 


3 


3.5358-2 


-3.1891" 


-5 


9.0480-3 


3.3061-3 


8.4186-2 


0.5 


1.2117" 


-1 


3.1528" 


-2 


3.7742" 


-4 


2.3867-3 


-2.7215" 


-6 


5.7965-4 


1.7170-4 


8.1126-3 


0.7 


2.0077" 


-2 


3.0886" 


-3 


1.3434" 


-5 


5.4244-5 


-1.0106" 


-7 


1.2936-5 


3.5304-6 


3.8948-4 


0.9 


3.5111" 


-4 


1.7783" 


-5 


8.651" 


-9 


2.695-^ 


-1.476- 


10 


7.132-9 


2.990-9 


1.2136-6 




10-7 


1.3416" 


-4 


8.7497" 


-5 


4.9751" 


-6 


1.3020-^2 


2.1524" 


--5 


6.4025+1 


6.3308+1 


1.0210+3 


10-6 


6.2804" 


-4 


3.9406" 


-4 


2.2443" 


-5 


6.6914"^^ 


6.5149" 


-5 


3.2602+1 


3.2032+1 


4.9626+2 


10-5 


2.9032" 


-3 


1.7575" 


-3 


9.6205" 


-5 


3.2497+1 


1.5858" 


-4 


1.5570+1 


1.5118+1 


2.2307+2 


10-4 


1.3206" 


-2 


7.7673" 


-3 


3.9093" 


-4 


1.4751-^1 


2.5665" 


-4 


6.8388+° 


6.4807+° 


9.0162+1 


10-3 


5.8047" 


-2 


3.3434" 


-2 


1.5180" 


-3 


6.1703+0 


1.0388" 


~4 


2.6695+° 


2.3917+° 


3.1114+1 


10-2 


2.2930" 


-1 


1.2857" 


-1 


5.4626" 


-3 


2.2492+° 


-3.9979" 


-4 


8.4058-1 


6.5087-1 


8.0993+° 


0.1 


5.5428" 


-1 


2.7326" 


-1 


1.0072" 


-2 


3.9297-1 


-2.1594" 


-4 


1.1439-1 


5.9713-2 


9.0851-1 


0.3 


3.5501" 


-1 


1.3205" 


-1 


3.0557" 


-3 


3.6008-2 


-2.2632" 


-5 


9.2227-3 


3.3771-3 


8.5022-2 


0.5 


1.2340" 


-1 


3.2166" 


-2 


3.8590" 


-4 


2.4459-3 


-1.9420" 


-6 


5.9487-4 


1.7699-4 


8.2293-3 


0.7 


2.0597" 


-2 


3.1751" 


-3 


1.3849" 


-5 


5.5722-5 


-7.2616" 


--8 


1.3244-5 


3.5361-6 


3.9687-4 


0.9 


3.6527" 


-4 


1.8544" 


-5 


9.050" 


-9 


2.663-^ 


-1.075- 


10 


6.713-9 


2.377-9 


1.2489-6 




10-7 


1.7912" 


-4 


1.2521" 


-4 


6.4933; 


-6 


1.2714+2 


4.9649" 


-5 


6.2498+1 


6.1784+1 


9.2473+2 


10-6 


7.7377" 


-4 


5.1222" 


-4 


2.8719" 


-5 


6.7701+1 


1.4743" 


-4 


3.2999+1 


3.2432+1 


4.6863+2 


10-5 


3.3184" 


-3 


2.0760" 


-3 


1.1977" 


-4 


3.3644+1 


3.5445" 


-4 


1.6147+1 


1.5696+1 


2.1747+2 


10-4 


1.4184" 


-2 


8.4455" 


-3 


4.6630" 


-4 


1.5408+1 


5.6829" 


-4 


7.1705+° 


6.8139+° 


8.9820+1 


10-3 


5.9793" 


-2 


3.4418" 


-2 


1.6996" 


-3 


6.4042+° 


2.2278" 


-4 


2.7892+° 


2.5128+° 


3.1336+1 


10-2 


2.3106" 


-1 


1.2914" 


-1 


5.7016" 


-3 


2.2876+° 


-8.9125" 


-4 


8.6205-1 


6.7377-1 


8.1589+° 


0.1 


5.5039" 


-1 


2.7075" 


-1 


1.0031" 


-2 


3.8850-1 


-4.7466" 


-4 


1.1332-1 


5.9489-2 


9.0795-1 


0.3 


3.4890" 


-1 


1.2949" 


-1 


2.9943" 


-3 


3.5090-2 


-4.9304" 


-5 


8.9667-3 


3.2670-3 


8.4309-2 


0.5 


1.2026" 


-1 


3.1269" 


-2 


3.7428" 


-4 


2.3729-3 


-4.1981" 


-6 


5.7783-4 


1.7390-4 


8.1099-3 


0.7 


1.9867" 


-2 


3.0534" 


-3 


1.3273" 


-5 


5.4635-5 


-1.5541" 


-7 


1.3275-5 


3.9930-6 


3.8824-4 


0.9 


3.4524" 


-4 


1.7466" 


-5 


8.489" 


-9 


3.030-s 


-2.255" 


10 


8.863-9 


4.8O3-9 


1.2026-6 
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Table 15: As Table 14, but for the variable- A^f evolution using the flavour matching conditions of Ref. [156, 158, 159]. The 
corresponding values for the strong coupling as(M? ~ 10^ GeV^) are given by 0.115818, 0.115605 and 0.115410 for = 
0.5, 1 and 2, respectively. For brevity the small, but non-vanishing valence distributions s„, c„ and 6„ are not displayed. 











NNLO, A^f 


= 3 ...5, 


^4 = 104 GeV2 






X 


XUju 






2xL^ 


XS + 






xg 




10-7 


1.5978" 


-4 


1.0699" 


-5 


6.0090" 


-6 


1.3916+2 


6.8509+1 


6.6929+1 


5.7438+1 


9.9694+3 


10-6 


7.1787" 


-4 


4.5929" 


-4 


2.6569" 


-5 


7.1710"^^ 


3.5003+1 


3.3849+1 


2.8332+1 


4.8817+2 


10-^ 


3.1907" 


-3 


1.9532" 


-3 


1.1116" 


^4 


3.4732+1 


1.6690+1 


1.5875+1 


1.2896+1 


2.2012+2 


10-4 


1.4023" 


^2 


8.2749" 


-3 


4.3744" 


^4 


1.5617+1 


7.2747+° 


6.7244+0 


5.2597+° 


8.8804+1 


10-3 


6.0019" 


-2 


3.4519" 


-2 


1.6296" 


-3 


6.4173+° 


2.7954+° 


2.4494+° 


1.8139+° 


3.0404+1 


10-2 


2.3244" 


-1 


1.3000" 


-1 


5.6100" 


-3 


2.2778+° 


8.5749-1 


6.6746-1 


4.5073-1 


7.7912+° 


0.1 


5.4993" 


-1 


2.7035" 


-1 


9.9596" 


-3 


3.8526-1 


1.1230-1 


6.4466-2 


3.7280-2 


8.5266-1 


0.3 


3.4622" 


-1 


1.2833" 


-1 


2.9572" 


-3 


3.4600-2 


8.8410-3 


4.0134-3 


2.1047-3 


7.8898-2 


0.5 


1.1868" 


-1 


3.0811" 


-2 


3.6760" 


-4 


2.3198-3 


5.6309-4 


2.3752-4 


1.2004-4 


7.6398-3 


0.7 


1.9486" 


-2 


2.9901" 


-3 


1.2957" 


-5 


5.2352-5 


1.2504-5 


5.6038-6 


2.8888-6 


3.7080-4 


0.9 


3.3522" 


-4 


1.6933" 


-5 


8.209" 


'9 


2.574-8 


6.856-*^ 


4.337-9 


2.679-9 


1.1721-6 


/fir =2^? 


10-7 


1.3950" 


-4 


9.0954" 


-5 


5.2113" 


-6 


1.3549+2 


6.6672+1 


6.5348+1 


5.6851+1 


1.0084+3 


10-6 


6.4865" 


-4 


4.0691" 


-4 


2.3344" 


-5 


6.9214+1 


3.3753+1 


3.2772+1 


2.7818+1 


4.8816+2 


10-^ 


2.9777" 


-3 


1.8020" 


-3 


9.9329" 


-5 


3.3385+1 


1.6015+1 


1.5306+1 


1.2601+1 


2.1838+2 


10-4 


1.3452" 


-2 


7.9078" 


-3 


4.0036" 


-4 


1.5035+1 


6.9818+° 


6.4880+° 


5.1327+° 


8.7550+1 


10-3 


5.8746" 


-2 


3.3815" 


-2 


1.5411" 


-3 


6.2321+° 


2.7012+° 


2.3747+° 


1.7742+° 


3.0060+1 


10"2 


2.3063" 


-1 


1.2923" 


-1 


5.4954" 


-3 


2.2490+° 


8.4141-1 


6.5083-1 


4.4354-1 


7.7495+° 


0.1 


5.5279" 


^1 


2.7222" 


-1 


1.0021" 


-2 


3.8897-1 


1.1312-1 


6.2917-2 


3.7048-2 


8.5897-1 


0.3 


3.5141" 


-1 


1.3051" 


-1 


3.0134" 


-3 


3.5398-2 


9.0559-3 


3.8727-3 


2.0993-3 


8.0226-2 


0.5 


1.2140" 


^1 


3.1590" 


-2 


3.7799" 


-4 


2.3919-3 


5.8148-4 


2.2376-4 


1.1918-4 


7.8098-3 


0.7 


2.0120" 


-2 


3.0955" 


-3 


1.3462" 


-5 


5.4194-5 


1.2896-5 


5.0329-6 


2.8153-6 
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Table 16: Reference results for the Nf — 4 (FFN) and the variable- A'f (VFN) polarized leading-order evolution of the initial 
distributions (4.57), shown together with these boundary conditions. The respective values for aa(/i? = = 10* GeV^) read 
0.117574 (FFN) and 0.122306 (VFN). The notation is the same as for the unpolarized case. 
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Table 17: Reference results for the polarized next-to-leading-order polarized evolution of the initial distributions (4.57) with 
A^f = 4 quark flavours. The corresponding value of the strong coupling is Qs = 10"* GeV^) = 0.110902. As in the 
leading-order case, the valence distributions s„ and c„ vanish for the input (4.57). 











Pol. NLO, iVf 


= 4, ^2 = 


10 


4GeV2 












X 


XUu 






2xL+ 




XC+ 


xg 






























10 ' 


6.7336-^ 


-5 


-2.5747" 


-5 


-1.1434-4 


-5.2002" 


-3 


-2.0528"" 


3 


-1.5034" 


3 


2.6955" 


-2 


10-6 


3.1280" 


-4 


-1.1938" 


-4 


-2.3497-4 


-1.0725" 


-2 


-4.2774- 


3 


-3.1845" 


3 


6.5928" 


-2 


10-5 


1.4180" 


-3 


-5.3982" 


-4 


-4.8579-4 


-1.9994" 


-2 


-7.8594" 


-3 


-5.6970" 


3 


1.4414" 


-1 


10-4 


6.2085" 


-3 


-2.3546" 


-3 


-9.8473-4 


-3.1788" 


-2 


-1.1749- 


2 


-7.5376" 


3 


2.7537" 


-1 


10-3 


2.5741" 


-2 


-9.7004" 


-3 


-1.8276-3 


-3.8222" 


-2 


-1.1427- 


2 


-3.6138" 


3 


4.3388" 


-1 


10-2 


9.6288" 


-2 


-3.5778" 


'2 


-2.6427-3 


-2.6437" 


-2 


-1.2328" 


3 


1.0869" 


2 


4.8281" 


-1 


0.1 


2.5843" 


-1 


-8.9093" 


'2 


-1.4593-3 


-7.5546" 


-3 


3.4258" 


3 


1.0639" 


2 


2.0096" 


-1 


0.3 


2.1248" 


-1 


-5.8641" 


-2 


-1.9269-4 


-1.2210" 


-3 


3.5155" 


4 


1.3138" 


3 


3.4126" 


-2 


0.5 


8.9180" 


-2 


-1.7817" 


-2 


-1.3125-5 


-9.1573" 


-5 


1.9823" 


5 


8.5435" 


5 


4.5803" 


-3 


0.7 


1.7300" 


-2 


-2.0885" 


-3 


-2.3388-^^ 


-1.9691" 


-6 


1.8480" 


7 


1.3541" 


6 


2.9526" 


-4 


0.9 


3.4726" 


-4 


-1.4028" 


-5 


-4.407-^^ 


-4.247" 


-9 


-1.903" 


9 


-1.683" 


9 


1.2520" 


-6 




lo-'^ 


6.1781" 


-5 


-2.3641" 


-5 


-1.1137-4 


-4.6947" 


-3 


-1.8092" 


3 


-1.2695" 


3 


2.2530" 


-2 


10-*^ 


2.8974" 


-4 


-1.1068" 


-4 


-2.2755-4 


-9.8528" 


-3 


-3.8580" 


3 


-2.7838" 


3 


5.7272; 


-2 


10-5 


1.3281" 


'3 


-5.0612" 


-4 


-4.6740-4 


-1.8799" 


-2 


-7.2908" 


3 


-5.1629" 


3 


1.2975" 


-1 


10-4 


5.8891" 


'3 


-2.2361" 


-3 


-9.4412-4 


-3.0787" 


-2 


-1.1292" 


2 


-7.1363" 


3 


2.5644" 


-1 


10-3 


2.4777" 


-2 


-9.3502" 


-3 


-1.7632-3 


-3.8610" 


^2 


-1.1658" 


2 


-3.9083" 


3 


4.1725" 


-1 


10-2 


9.4371" 


-2 


-3.5129" 


-2 


-2.6087-3 


-2.8767" 


^2 


-2.3430; 


3 


9.7922; 


3 


4.7804" 


-1 


0.1 


2.6008" 


-1 


-8.9915" 


-2 


-1.4923-3 


-8.3806" 


-3 


3.1932" 


3 


1.0585" 


2 


2.0495" 


-1 


0.3 


2.1837" 


-1 


-6.0497" 


-2 


-2.0143-4 


-1.2157" 


-3 


3.9810" 


-4 


1.4042" 


3 


3.5366" 


-2 


0.5 


9.3169" 


^2 


-1.8699" 


-2 


-1.3954-5 


-7.9331" 


-5 


3.0091" 


5 


9.9849" 


5 


4.7690" 


-3 


0.7 


1.8423" 


^2 


-2.2357" 


-3 


-2.5360-^^ 


-1.0062" 


-6 


7.6483" 


7 


2.0328" 


6 


3.0796" 


-4 


0.9 


3.8293" 


-4 


-1.5559" 


-5 


-4.952-11 


-1.955" 


-9 


-7.298-1° 


-4.822-1° 


1.3247" 


-6 














= 1/2%' 
















10-^ 


7.4443" 


-5 


-2.8435" 


-5 


-I.I8I5-4 


-5.7829" 


-3 


-2.3341" 


3 


-1.7739" 


3 


3.2071" 


-2 


10-6 


3.4143" 


~4 


-1.3016" 


-4 


-2.4482-4 


-1.1668" 


-2 


-4.7305" 


3 


-3.6168" 


3 


7.5123" 


-2 


10-5 


1.5256" 


-3 


-5.8002" 


-4 


-5.1085-4 


-2.1193" 


-2 


-8.4295" 


3 


-6.2295; 


3 


1.5788" 


-1 


10-4 


6.5726" 


-3 


-2.4891" 


-3 


-1.0409-3 


-3.2697" 


-2 


-1.2166" 


2 


-7.8952" 


3 


2.9079" 


-1 


10-3 


2.6766" 


-2 


-1.0070" 


-2 


-1.9171-3 


-3.7730" 


-2 


-1.1160" 


2 


-3.2890; 


3 


4.4380" 


-1 


10-2 


9.8073" 


-2 


-3.6370" 


-2 


-2.6942-3 


-2.4056" 


-2 


-1.2354; 


4 


1.1929" 


2 


4.8272" 


-1 


0.1 


2.5628" 


-1 


-8.8133" 


-2 


-1.4304-3 


-6.9572" 


-3 


3.5561" 


3 


1.0604" 


2 


1.9831" 


-1 


0.3 


2.0709" 


-1 


-5.6988" 


-2 


-1.8541-4 


-1.3308" 


-3 


2.5993" 


4 


1.1855" 


3 


3.3524" 


-2 


0.5 


8.5835" 


-2 


-1.7089" 


-2 


-1.2463-5 


-1.1920" 


-4 


2.6972; 


6 


6.4995" 


5 


4.5044" 


-3 


0.7 


1.6405" 


-2 


-1.9723" 


-3 


-2.1859-7 


-3.6817" 


-6 


-7.4795; 


7 


3.4496" 


7 


2.9100" 


-4 


0.9 


3.2011" 


-4 


-1.2870" 


-5 


-4.000-11 


-8.173" 


-9 


-3.886" 


9 


-3.686" 


9 


1.2230" 


-6 



100 



Table 18: As Table 17, but for the variable- A^f evolution using Eqs. (4.53), (4.54) and (4.57). The corresponding values for the 
strong coupling = GeV^) are given by 0.116461, 0.116032 and 0.115663 for /ij? //if = 0.5, 1 and 2, respectively. 
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Fig. 65: Relative errors made by using the previous average approximations [152] for the three-loop splitting functions (used, 
e.g., in Refs. [39,40]) instead of the full results [102, 103], on the NNLO evolution of the input (4.53) - (4.55) at fi^ = /if. 
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Fig. 66: Probing (a) a valence parton in the proton and (b) a sea parton in a hadronic fluctuation (letters are four-momenta) 
resulting in (c) parton distributions at the starting scale Qq. 



4.5 Non-perturbative x-shape of PDFs 

The x-shape of parton density functions at a low scale Qq is due to the dynamics of the bound state 
proton and is hence an unsolved problem of non-perturbative QCD. Usually this is described by parame- 
terizations of data using more or less arbitrary functional forms. More understanding can be obtained by 
a recently developed physical model [160], which is phenomenologically successful in describing data. 

The model gives the four-momentum /c of a single probed valence parton (Fig. 66a) by assuming 
that, in the nucleon rest frame, the shape of the momentum distribution for a parton of type i and mass 
rrii can be taken as a Gaussian fi{k) = N{ai, mi) exp {— [(fco — mj)^ + k"^ + ky + fc^] /2o'f }, which 
may be motivated as a result of the many interactions binding the parton in the nucleon. The width of 
the distribution should be of order hundred MeV from the Heisenberg uncertainty relation applied to the 
nucleon size, i.e. ai = l/dj^. The momentum fraction x of the parton is then defined as the light-cone 
fraction x = k^ /p^ and is therefore invariant under longitudinal boosts (e.g. to the infinite momentum 
frame). Constraints are imposed on the final-state momenta to obtain a kinematicaUy allowed final state, 
which also ensures that < a; < 1 and fi{x) — > for x — > 1. 

The sea partons are obtained using a hadronic basis for the non-perturbative dynamics of the bound 
state proton and considering hadronic fluctuations 



Probing a parton i in a hadron H of a. baryon-meson fluctuation \BM) (Fig. 66b) gives a sea parton 
with light-cone fraction x = xr Xi of the target proton. The momentum of the probed hadron is given 
by a similar Gaussian, but with a separate width parameter an- Also here, kinematic constraints ensure 
physically allowed final states. 

Using a Monte Carlo method the resulting valence and sea parton x-distributions are obtained 
without approximations. These apply at a low scale Qq and the distributions at higher Q'^ are obtained 
using perturbative QCD evolution at next-to-leading order. To describe all parton distributions (Fig. 66c), 
the model has only four shape parameters and three normalization parameters, plus the starting scale: 



These are determined from fits to data as detailed in [160] and illustrated in Fig. 67. The model repro- 
duces the inclusive proton structure function and gives a natural explanation of observed quark asymme- 
tries, such as the difference between the up and down valence distributions and between the anti-up and 

'''^Contributing author: G. Ingelman 



\p) = ao\Po) + apnolpT^ ) + an7r+l"'7r+) + . . . + aAK\^K+) + ... 



(4.58) 



a, 



- 230 MeV ad = 170 MeV ag = 77 MeV an = 100 MeV 

:2 = 0.45 a2 ^ q^^^^ ^2^ ^ g^Qg ^ 0^75 Q^y 



(4.59) 
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Fig. 67: F2{x, Q^) from HI compared to the model with ±50% variation of the width parameter ag of the gluon distribution. 



anti-down sea quark distributions. Moreover, its asymmetry in the momentum distribution of strange and 
anti-strange quarks in the nucleon is large enough to reduce the NuTeV anomaly to a level which does 
not give a significant indication of physics beyong the Standard Model. 

Recent fits of PDF's at very low x and have revealed problems with the gluon density, which 
in some cases even becomes negative. The reason for this is that the DGLAP evolution, driven primarily 
by the gluon at small x, otherwise gives too large parton densities and thereby a poor fit to F2 in the 
genuine DIS region at larger Q^. It has been argued [161] that the root of the problem is the application 
of the formalism for DIS also in the low-Q^ region, where the momentum transfer is not large enough 
that the parton structure of the proton is clearly resolved. The smallest distance that can be resolved 
is basically given by the momentum transfer of the exchanged photon through d = where 
d is in Fermi if is in GeV^. This indicates that partons are resolved only for > 1 GeV^. For 
Q2 < iGeV^ there is no hai^d scale involved and a parton basis for the description is not justified. 
Instead, the interaction is here of a soft kind between the nearly on-shell photon and the proton. The 
cross section is then dominated by the process where the photon fluctuates into a virtual vector meson 
state which then interacts with the proton in a strong interaction. The quantum state of the photon can 
be expressed as I7) = C0I70) + J2v tI^I^) /mo '^'^(' ' ' )• The sum is over V = ,uj, (/)... as in 
the original vector meson dominance model (VDM), whereas the generalised vector meson dominance 
model (GVDM) also includes the integral over a continuous mass spectrum (not written out explicitly 
here). 

Applied to ep at low this leads to the expression [161] 




Q2 + mg+^^P^"^' + ^) 



m 

X' 



(4.60) 



where the hadronic cross-section cj{ip — > X) = Ais'^ + BiS ^ w Ais'^ w Ai{Q'^/xy has been used 
for the small-x region of interest. The parameters involved are all essentially known from GVDM phe- 
nomenology. With e = 0.091, ^ = 0.34, mo = 1.5 GeV and A^ = 71 ph, this GVDM model gives a 
good fit (x^/d.o.f. = 87/66 = 1.3) as illustrated in Fig. 68. Using this model at very low in com- 
bination with the normal parton density approach at larger it is possible to obtain a good description 
of data over the full range [161]. This involves, however, a phenomenological matching of these two 
approaches, since a theoretically well justified combination is an unsolved problem. 

Neglecting the GVDM component when fitting PDF's to data at small may thus lead to an 
improper gluon distribution, which is not fully universal and therefore may give incorrect results when 
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Fig. 68: F2 data at low from ZEUS compared to the full GVDM in eq. (4.60) (full curves), when excluding the longitudinal 
contribution of the continuum (^c = 0) and excluding the continuous contribution altogether (setting rc = 0) giving VDM. 



used for cross section calculations at LHC. 
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4,6 Towards precise determination of the nucleon PDFs 

The nucleon parton distribution functions (PDFs) available to the moment are extracted from the rather 
limited set of experimental distributions (the deep-inelastic scattering (DIS) structure functions, the 
Drell-Yan (DY) and jet production cross sections). Other high-energy processes potentially could pro- 
vide additional constraints on PDFs, however insufficient theoretical understanding does not allow to use 
those data without risk of having uncontrolled theoretical inaccuracies. Even for the case of the exist- 
ing global fits of the PDFs performed by the MRST and CTEQ groups missing next-to-next-to-leading 
(NNLO) order QCD corrections to the Drell-Yan and jet production cross sections are not small as com- 
pared to the accuracy of the corresponding data used and therefore might give non-negligible effect. In 
this section we outline progress in the QCD fits with consistent account of the NNLO coiTcctions. 

4.6.1 Impact of the NNLO evolution kernel fixation on PDFs 

In order to allow account of the NNLO corrections in the fit of PDFs one needs analytical expressions 
for the 3-loop corrections to the QCD evolution kernel. Until recent times these expressions were known 
only in the approximate form of Ref. [152] derived from the partial information about the kernel, includ- 
ing the set of its Mellin moments and the low-x asymptotics [107, 114, 115] However with the refined 
calculations of Ref. [102, 103] the exact expression for the NNLO kernel has been available. These im- 
provement is of particular importance for analysis of the low-x data including the HERA ones due to 
general rise of the high-order QCD coiTcction at low x. We illustrate impact of the NNLO evolution 
kernel validation on PDFs using the case of fit to the global DIS data [41,53, 162-165]. The exact NNLO 
corrections to the DIS coefficient functions are know [100, 166] that allowed to perform approximate 
NNLO fit of PDFs to these data [40] using the approximate NNLO corrections to the evolution kernel 
of Ref. [152]. Taking into account exact NNLO evolution kernel the analysis of Ref. [40] was updated 
recently to the exact NNLO case [167]. 

The gluon distributions at small x obtained in these two variants of the fit are compared in Fig.69. 
With the exact NNLO corrections the QCD evolution of gluon distribution at small x gets weaker and 
as a result at small x/Q the gluon distribution obtained using the precise NNLO kernel is quite dif- 
ferent from the approximate one. In particular, the approximate NNLO gluon distribution is negative 

^ 1-3 GeV^, while the precise one remains positive even below Q"^ = 1 GeV^. For the NLO 
case the positivity of gluons at small x/Q is even worse than for the approximate NNLO case due to 
the approximate NNLO coiTcctions dampen the gluon evolution at small x too, therefore account of the 
NNLO corrections is crucial in this respect, (cf. discussion of Ref. [168]). Positivity of the PDFs is not 
mandatory beyond the QCD leading order, however it allows probabilistic interpretation of the parton 
model and facilitates modeling of the soft processes, such as underlying events in the hadron-hadron col- 
lisions at LHC. The change of gluon distribution at small x/Q as compared to the fit with approximate 
NNLO evolution is rather due the change in evolution kernel than due to shift in the fitted parameters 
of PDFs. This is clear from comparison of the exact NNLO gluon distribution to one obtained from the 
approximate NNLO fit and evolved to low Q using the exact NNLO kernel (see Fig.69). In the vicinity 
of crossover in the gluon distribution to the negative values its relative change due to variation of the 
evolution kernel is quite big and therefore further fixation of the kernel at small x discussed in Ref. [169] 
might be substantial for validation of the PDFs at low x/Q. For the higher-mass kinematics at LHC 
numerical impact of the NNLO kernel update is not dramatic. Change in the Higgs and W/Z bosons 
production cross sections due to more precise definition of the NNLO PDFs is comparable to the enws 
coming from the PDFs uncertainties, i.e. at the level of several percent. 

^'Contributing author: S. I. Alekhin 
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Fig. 69: The gluon distributions obtained in the different variants of PDFs fit to the DIS data (soHd: the fit with exact NNLO 
evolution; dashes: the fit with approximate NNLO evolution; dots: the approximate NNLO gluons evolved with the exact 
NNLO kernel; dashed-dots: the NLO fit). 



4.6.2 NNLO fit of PDFs to the combined DIS and Drell-Yan data 

The DIS process provide very clean source of information about PDFs both from experimental and 
theoretical side, however very poorly constrains the gluon and sea distributions at x > 0.3. The well 
known way to improve precision of the sea distributions is to combine DIS data with the Drell-Yan ones. 
The cross section of process NN — > I'^l" reads 

(^DY oc ^ [qi{xi)qi{x2) + qi{x2)qi{xi)\ + higher-order terms, 

i 

where q{q)i are the quarks(antiquarks) distribution and xi^2 give the momentum fractions carried by each 
of the colliding partons. The quark distributions are determined by the DIS data with the precision of 
several percent in the wide region of x and therefore precision of the sea distribution extracted from the 
combined fit to the DIS and DY data is basically determined by the latter. The Fermilab fixed-target 
experiments provide measurements of the DY cross sections for the isoscalar target [170] and the ratio 
of cross sections for the deuteron and proton targets [71] with the accuracy better than 20% at x < 0.6. 
Fitting PDFs to these data combined with the global DIS data of Ref. [41,53, 162-165] we can achieve 
comparable precision in the sea distributions. Recent calculations of Ref. [171] allow to perform this 
fit with full account of the NNLO correction. Using these calculations the DY data of Refs. [71, 170] 
were included into the NNLO fit of Ref. [167] that leads to significant improvement in the precision of 
sea distributions (see Fig. 70). Due to the DY data on the deuteron/proton ratio the isospin asymmetry 
of sea is also improved. It is worth to note that the precision achieved for the total sea distribution is 
in good agreement to the rough estimates given above. The value of x^/NDP obtained in the fit is 1.1 
and the spread of x^/NDP over sepai^ate experiments used in the fit is not dramatic, its biggest value 
is 1.4. We rescaled the errors in data for experiments with x^/NDP > 1 in order to bring x^/NDP 
for this experiments to 1 and found that overall impact of this rescaling on the PDFs errors is marginal. 
This proofs sufficient statistical consistency of the data sets used in the fit and disfavors huge increase 
in the value of Ax^ criterion suggested by the CTEQ collaboration for estimation of errors in the global 
fit of PDFs. A particular feature of the PDFs obtained is good stability with respect to the choice of 
factorization/renormalization scale in the DY cross section: Variation of this scale from M„+„-/2 to 
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Fig. 70: Uncertainties in the non-strange sea distributions obtained from NNLO QCD fit to the DIS data combined with the fixed 
target Drell-Yan data (solid curves). The same uncertainties obtained in fit to the DIS data only [8] are given for comparison by 
dashes. 

2M^+^- leads to variation of PDFs comparable to their uncertainties due to errors in data. 
4.6.3 LHC data and flavor separation of the sea at small x 

Combination of the existing DIS and fixed-target DY data provide good constraint on the total sea quarks 
distribution and allows separation of the u- and J-quai^k distributions up to the values of x sufficient for 
most practical applications at the LHC. At small x the total sea is also well constrained by the precise 
HERA data on the inclusive structure functions, however -u/J separation is poor in this region due to lack 
of the deuteron target data at HERA. The problem of the sea flavor separation is regulai^ly masked due 
to additional constraints imposed on PDFs. In particular, most often the Regge-like behavior of the sea 
isospin asymmetry x{d — u) oc x'^"'* is assumed with Uud selected around value of 0.5 motivated by the 
intercept of the meson trajectories. This assumption automatically provides constraint J = -u at x — > 
and therefore leads to suppression of the uncertainties both in u and d at small x. If we do not assume 
the Regge-like behavior of x{d — u) its precision determined from the NNLO fit to the combined DIS 
and DY data of Section 1.2 is about 0.04 at x = 10"^ furthermore this constraint is defined rather by 
assumption about the shape of PDFs at small x than by data used in the fit. The strange sea distribution 
is known much worse than the non-strange ones. It is essentially defined only by the CCFR experiment 
from the cross section of dimuon production in the neutrino nucleus collisions [172]. In this experiment 
the strange sea distribution was probed at x = 0.01 0.2 and the shape obtained is similar- to one of 
the non-strange sea with the strangeness suppression factor about 0.5. This is in clear disagreement with 
the Regge-lrke constraint on x{d — s) or x{u — s) and therefore we cannot use even this assumption to 
predict the strange sea at small x. 

The LHC data on ^^/i^ production cross section can be used for further validation of the sea dis- 
tributions at small x. Study of this process at the lepton pair masses down to 15 GeV will allow to probe 
PDFs at X down to 10~^, while with both leptons detected full kinematics can be reliably reconstructed. 
In order to check impact of the foreseen LHC data on the sea flavor separation we generated sample 
of pseudo-data for the process pp fi~^fi^X at = 14 TeV with integral luminosity of 10 1/fb 
corresponding to the first stage of the LHC operation. In order to meet typical limitations of the LHC 
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Fig. 71: The la error band for x{d — u) (upper panel) and x{d — s) (lower panel) expected for the fit of PDFs to the LHC data 
combined with the global DIS ones. Dashed curves correspond to the case of ^/7*-production, dots are for the combination 
Zf-y*- with the VF^/M^" -production. Solid curves are for the central values obtained from the reference fit to the global DIS 
data 



detectors only events with the lepton pair absolute rapidity less than 2.5 were accepted; other detector 
effects were not taken into account. For generation of these pseudo-data we used PDFs obtained in the 
dedicated version of fit [167] with the sea distributions parameterized as xS^^^s = 'nu,d,sX°'{^ — x)^" '* '' 
with the constraints rju = rjfi = r]s and bg = + bd)/2 imposed. These constraints are necessary for 
stability of the fit in view of limited impact of the DIS data on the flavor separation and, besides, the 
former one guarantees SU(3) symmetry in the sea distributions at small x. The generated pseudo-data 
were added to the basic DIS data sample and the errors in PDFs parameters were re-estimated with no 
constraints on the sea distributions imposed at this stage. Since dimuon data give extra information about 
the PDFs products they allow to disentangle the strange distribution, if an additional constraint on the 
non-strange sea distributions is set. The dashed curves in the lower panel of Fig.71 give the la bands for 
x{d — s) as they are defined by the LHC simulated data combined with the global DIS ones given {d — u) 
is fixed. One can see that d/s (and u/s) separation at the level of several percents would be feasible 
down to x=10~'^ in this case. The supplementary constraint on {d — u) can be obtained from study of the 
-boson chai-ge asymmetry. To estimate impact of this process we simulated the single W^- and W~- 
production data similarly to the case of the yU^'^^Li^-production and took into account this sample too. In 
this case one can achieve separation of all three flavors with the precision better than 0.01 (see Fig.71). 
Note that strange sea separation is also improved due to certain sensitivity of the VF-production cross 
section to the sti'ange sea contribution. The estimates obtained refer to the ideal case of full kinematical 
reconstruction of the VF-bosons events. For the case of using the charge asymmetry of muons produced 
from the T4^-decays the precision of the PDFs would be worse. Account of the backgrounds and the 
detector effects would also deteriorate it, however these losses can be at least partially compensated by 
rise of the LHC luminosity at the second stage of operation. 
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Valence 




0.718 ±0.085 




bu 


3.81 ±0.16 






-1.56 ±0.46 




lu 


3.30 ± 0.49 




ad 


1.71 ±0.20 




bd 


10.00 ±0.97 




ed 


-3.83 ±0.23 




Id 


4.64 ±0.41 


Sea 


As 


0.211 ±0.016 




as 


-0.048 ± 0.039 




bs 


2.20 ±0.20 


Glue 


ao 


0.356 ± 0.095 




be 


10.9 ± 1.4 




as(Mz) 


0.1132 ±0.0015 



Table 19: Values of the parameters obtained in the benchmark fit. 
4.6.4 Benchmarking of the PDFs fit 

For the available nucleon PDFs the accuracy at percent level is reached in some kinematical regions. 
For this reason benchmarking of the codes used in these PDFs fits is becoming important issue. A 
tool for calibration of the QCD evolution codes was provided by Les Houches workshop [150]. To allow 
benchmarking of the PDFs errors calculation we performed a test fit suggested in Les Houches workshop 
too. This fit reproduces basic features of the existing global fits of PDFs, but is simplified a lot to facilitate 
its reproduction. We use for the analysis data on the proton DIS structure functions F2 obtained by the 
BCDMS, NM, HI, and ZEUS collaborations and ratio of the deuteron and proton structure functions F2 
obtained by the NMC. The data tables with full description of experimental errors taken into account 
are available online^". Cuts for the momentum transferred > 9 GeV^ and for invariant mass of the 
hadronic system > 15 GeV^ are imposed in order to avoid influence of the power coiTcctions and 
simplify calculations. The contribution of the Z-boson exchange at large Q is not taken into account for 
the same purpose. The PDFs are parameterized in the form 

xpi{x,lGeV) = Nix'''{l-xf'{l + ei^/x + J^x), 

to meet choice common for many popular global fits of PDFs. Some of the parameters e-i and 7^ are set 
to zero since they were found to be consistent to zero within the errors. We assume isotopic symmetry 
for sea distribution and the strange sea is the same as the non-strange ones suppressed by factor of 
0.5. Evolution of the PDFs is performed in the NLO QCD approximation within the MS scheme. The 
heavy quarks contribution is accounted in the massless scheme with the variable number of flavors (the 
thresholds for c- and 6-quarks are 1.5 GeV and 4.5 GeV correspondingly). All experimental eiTors 
including coiTclated ones ai^e taken into account for calculation of the eiTors in PDFs using the covariance 
matrix approach [173] and assuming linear propagation of errors. The results of the benchmark fit 
obtained with the code used in analysis of Refs. [40, 167] are given in Tables 19 and 20. The total number 
of the fitted PDF parameters left is 14. The normalization parameters Ni for the gluon and valence 
quai^k distributions are calculated from the momentum and fermion number conservation. The remaining 
normalization parameter As gives the total momentum carried by the sea distributions. Important note is 
that in view of many model assumptions made in the fit these results can be used mainly for the purposes 
of benchmarking rather for the phenomenological studies. 



https://mail.ihep.rur alekhin/benchmark/TABLE 
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bu 




1.000 


0.728 


K 


0.728 


1.000 




-0.754 


-0.956 


lu 


-0.708 


-0.088 


ad 


0.763 


0.377 


bd 


0.696 


0.620 


ed 


-0.444 


-0.420 


Id 


0.215 


0.387 


As 


-0.216 


0.175 


as 


-0.473 


-0.182 


bs 


-0.686 


-0.713 


aa 


0.593 


0.067 


be 


0.777 


0.505 




-0.006 


-0.337 



Cm 


lu 


^d 


-0. /54 


-0. /Oo 


0. /63 


-0.956 


-0.088 


0.377 


1.000 


0.105 


-0.388 


0.105 


1.000 


-0.741 


-0.388 


-0.741 


1.000 


-0.662 


-0.390 


0.805 


0.503 


0.219 


-0.622 


-0.485 


0.107 


0.248 


-0.229 


0.597 


-0.367 


0.059 


0.591 


-0.509 


0.600 


0.310 


-0.528 


-0.047 


-0.716 


0.652 


-0.503 


-0.675 


0.664 


0.276 


-0.088 


0.101 



bd 


Ed 


Id 

1 (J, 


0.696 


-0.444 


0.215 


0.620 


-0.420 


0.387 


-0.662 


0.503 


-0.485 


-0.390 


0.219 


0.107 


0.805 


-0.611 


0.248 


1.000 


-0.904 


0.728 


-0.904 


1.000 


-0.896 


0.728 


-0.896 


1.000 


0.017 


-0.132 


0.346 


-0.193 


-0.019 


0.240 


-0.512 


0.245 


-0.107 


0.272 


-0.038 


-0.241 


0.576 


-0.362 


0.120 


-0.136 


0.173 


-0.228 



As 


as 


bs 


-0.216 


-0.473 


-0.686 


0.175 


-0.182 


-0.713 


-0.229 


0.059 


0.600 


0.597 


0.591 


0.310 


-0.367 


-0.509 


-0.528 


0.017 


-0.193 


-0.512 


-0.132 


-0.019 


0.245 


0.346 


0.240 


-0.107 


1.000 


0.708 


0.127 


0.708 


1.000 


0.589 


0.127 


0.589 


1.000 


-0.375 


-0.595 


-0.508 


-0.026 


-0.241 


-0.402 


0.047 


-0.011 


-0.109 





be 


as{Mz 


0.593 


0.777 


-0.006 


0.067 


0.505 


-0.337 


-0.047 


-0.503 


0.276 


-0.716 


-0.675 


-0.088 


0.652 


0.664 


0.101 


0.272 


0.576 


-0.136 


-0.038 


-0.362 


0.173 


-0.241 


0.120 


-0.228 


-0.375 


-0.026 


0.047 


-0.595 


-0.241 


-0.011 


-0.508 


-0.402 


-0.109 


1.000 


0.565 


0.587 


0.565 


1.000 


-0.138 


0.587 


-0.138 


1.000 



Table 20: Correlation coefficients for the parameters obtained in the benchmark fit. 



4.7 Benchmark Partons from DIS data and a Comparison with Global Fit Partons 

In this article 1 consider the uncertainties on partons arising from the errors on the experimental data that 
are used in aparton analysis. Various groups [174], [175], [40], [18], [41], [34], [44] have concentrated 
on the experimental eiTors and have obtained estimates of the uncertainties on parton distributions within 
a NLO QCD framework, using a variety of competing procedures. Here the two analyses, performed 
by myself and S. Alekhin (see Sec. 4.6) minimise the differences one obtains for the central values 
of the partons and the size of the uncertainties by fitting to exactly the same data sets with the same 
cuts, and using the same theoretical prescription. In order to be conservative we use only DIS data - 
BCDMS proton [163] and deuterium [164] fixed target data, NMC data on proton DIS and on the ratio 
F^{x, Q2) /F^x, [165], and HI [41] and ZEUS [53] DIS data. We also apply cuts of = QGeV^ 
and W"^ = 15GeV^ in order to avoid the influence of higher twist. We each use NLO perturbative QCD 
in the MS renormalization and factorization scheme, with the zero-mass variable flavour number scheme 
and quark masses of rric = l.SGeV and m{, = 4.5GeV. There is a very minor difference between 
asifJ-'^) used in the two fitting programs due to the different methods of implementing heavy quark 
thresholds (the differences being formally of higher order), as observed in the study by M. Whalley for 
this workshop [176]. If the couplings in the two approaches have the same value at fi^ = M|, then the 
MRST value is ~ 1% higher for ^ 20GeV^ 

We each input our parton distributions at Qq = IGeV^ with a parameterization of the form 

xMx,Ql) = Ml-xf^{l + e,x'^-'' + j,x)x''\ (4.61) 

The input sea is constrained to be 40% up and anti-up quarks, 40% down and anti-down quarks, and 
20% strange and antistrange. No difference between u and d is input. There is no negative term for the 
gluon, as introduced in [34], since this restricted form of data shows no strong requirement for it in order 
to obtain the best fit. Similai^ly we ai^e able to set eg, 7^, and 75 all equal to zero. Ag is set by the 
momentum sum rule and Auy and A^y are set by valence quark number. Hence, there are nominally 
13 free parton parameters. However, the MRST fitting program exhibited instability in the error matrix 
due to a very high correlation between uy parameters, so was set at its best fit value of e„ = —1.56, 
while 12 pai^ameters were free to vary. The coupling was also allowed to vary in order to obtain the best 
fit. The treatment of the errors on the data was exactly as for the published paitons with uncertainties for 
each group, i.e. as in [40] and [19]. This means that all detail on correlations between errors is included 
for the Alekhin fit (see Sec. 4.6), assuming that these errors are distributed in the Gaussian manner. The 
errors in the MRST fit are treated as explained in the appendix of [19], and the correlated errors are not 
allowed to move the central values of the data to as great an extent for the HERA data, and cannot do 
so at all for the fixed target data, where the data used are averaged over the different beam energies. 
The Alekhin approach is more statistically rigorous. The MRST approach is more pragmatic, reducing 
the ability of the data to move relative to the theory comparison by use of correlated eiTors (other than 
normalization), and is in some ways similar to the offset method [44]. The danger of this movement of 
data relative to theory has been suggested by the joint analysis of HI and ZEUS data at this workshop 
(see Sec. 3.3), where letting the joint data sets determine the movement due to correlated errors gives 
different results from when the data sets are compared to theoretical results. 

4. 7. 1 Comparison Between the Benchmark Parton Distributions. 

I compare the results of the two approaches to fitting the restricted data chosen for the benchmarking. 
The input parameters for the Alekhin fit ai^e presented in Sec. 4.6. Those for the MRST type fit are 
similar, but there are some differences which are best illustrated by comparing the paitons at a typical 
for the data, e.g. = 20GeV^. A comparison is shown for the dy quarks and the gluon in Fig. 72. 

From the plots it is clear that there is generally good agreement between the parton distributions. 
The central values are usually very close, and nearly always within the uncertainties. The difference 

Contributing author: R.S. Tiiorne. 
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Fig. 72: Left plot; xdv{x, 20) from the MRST benchmark partons compared to that from the Alekhin benchmark partons. 
Right plot: xg{x, 20) from the MRST benchmark partons compared to that from the Alekhin benchmark partons. 

in the central values is mainly due to the different treatment of correlated errors, and partially due to 
the difference in the coupling definition. The uncertainties are similar- in the two sets, but are generally 
about 1.2 — 1.5 times larger for the Alekhin partons, due to the increased freedom in the use of the 
correlated experimental errors. The values of as{M^) are quite different, as{M^) = 0.1132 it 0.0015 
compared to 0.1110 ± 0.0012. However, as mentioned earlier, one expects a 1% difference due to the 
different threshold prescriptions - the MRST as would be lai^ger at ~ 20GeV^, where the data are 
concentrated, so correspondingly to fit the data it receives a 1% shift downwards for = M^. Once 
this systematic effect is taken into account, the values of as{M'^) are very compatible. Hence, there is 
no surprising inconsistency between the two sets of paiton distributions. 

4. 7.2 Comparison of the Benchmark Parton Distributions and Global Fit Partons. 

It is also illuminating to show the comparison between the benchmark partons and the published partons 
from a global fit. This is done below for the MRSTOl partons. For example, uv{x, Q"^) and u{x, Q^) are 
shown in Fig. 73. It is striking that the uncertainties in the two sets are rather similar. This is despite the 
fact that the uncertainty on the benchmark partons is obtained from allowing Ax^ = 1 in the fit while 
that for the MRSTOl partons is obtained from Ax^ = 50.^^ This illustrates the great improvement in 
precision which is obtained due to the increase in data from the relaxation of the cuts and the inclusion 
of types of data other than DIS. For the uy partons, which are those most directly constrained by the 
DIS data in the benchmark fit, the comparison between the two sets of partons is reasonable, but hardly 
perfect - the central values differing by a few standard deviations. This is particularly important given 
that in this comparison the treatment of the data in the fit has been exactly the same in both cases. There 
is a minor difference in theoretical approach because of the simplistic treatment of heavy flavours in 
the benchmar^k fit. However, this would influence the gluon and sea quar^ks rather than valence quarks. 

^"Though it is meant to be interpreted as a one sigma error in the former case and a 90% confidence limit in the latter. 
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Fig. 73: Left plot: xuv{x, 20) from the MRST benchmark partons compared to that from the MRSTOl partons. Right plot: 
xu{x, 20) from the MRST benchmark partons compared to that from the MRSTOl partons with emphasis on large x. 

Moreover, the region sensitive to this simplification would be ~ (the lower charge weighting for 
bottom quarks greatly reducing the effect neai^ = iri^) which is removed by the Q"^ cut of 9GeV^. 
Indeed, introducing the variable flavour number scheme usually used for the MRST partons modifies 
the benchmark partons only very minimally. Hence, if the statistical analysis is correct, the benchmark 
partons should agree with the global partons within their uncertainties (or at most 1.5 times their un- 
certainties, allowing for the effect of the correlated eiTors), which they do not. For the u partons the 
comparison is far worse, the benchmark partons being far larger at high x. 

This disagreement in the high-x u partons can be understood better if one also looks at the high- 
X dv distribution shown in Fig. 74. Here the benchmark distribution is very much smaller than for 
MRSTOl. However, the increase in the sea distribution, which is common to protons and neutrons, at 
high-x has allowed a good fit to the high-x BCDMS deuterium data even with the very small high-x dy 
distribution. In fact it is a better fit than in [19]. However, the fit can be shown to break down with the 
additional inclusion of high-x SLAC data [162] on the deuterium structure function. More dramatically, 
the shape of the u is also completely incompatible with the Drell-Yan data usually included in the global 
fit, e.g. [170, 177]. Also in Fig. 74 we see that the dy distributions are very different at smaller x. The 
benchmark set is markedly inconsistent with NMC data on F^^x, Q^)/F2 (x, Q"^) which is at small x, 
but below the cut of = QGeV^. 

The gluon from the benchmark set is also compared to the MRSTOl gluon in Fig. 75. Again there 
is an enormous difference at high x. Nominally the benchmark gluon has little to constrain it at high x. 
However, the momentum sum rule determines it to be very small in this region in order to get the best fit 
to HERA data, similar to the gluon from [41]. As such, the gluon has a small uncertainty and is many 
standard deviations from the MRSTOl gluon. Indeed, the input gluon at high x is so small that its value 
at higher is dominated by the evolution of uy quarks to gluons, rather than by the input gluon. Hence, 
the uncertainty is dominated by the quai^k parton input uncertainty rather than its own, and since the up 
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Fig. 74: Left plot: xdv(x, 20) from the MRST benchmark partons compared to that from the MRSTOl partons. Right plot: 
xdv{x, 20) from the MRST benchmark partons compared to that from the MRSTOl partons with emphasis on small x. 

quark is well determined the uncertainty on the high- a; gluon is small for the benchmark partons. The 
smallness of the high-j; gluon results in the benchmark partons producing a very poor prediction indeed 
for the Tevatron jet data [178, 179], which are the usual data that constrain the high-x gluon in global fits. 

It is also illustrative to look at small x. Here the benchmark gluon is only a couple of standard 
deviations from the MRSTOl gluon, suggesting that its size is not completely incompatible with a good 
fit to the HERA small-x data at below the benchmark cut. However, the uncertainty in the benchmark 
gluon is much smaller than in the MRSTOl gluon, despite the much smaller amount of low-x data in the 
fit for the benchmark partons. This comes about as a result of the artificial choice made in the gluon input 
at Qq. Since it does not have the term introduced in [19], allowing the freedom for the input gluon to be 
negative at very small x, the gluon is required by the fit to be valence-like. Hence, at input it is simply 
very small at small x. At higher it becomes much larger, but in a manner driven entirely by evolution, 
i.e. it is determined by the input gluon at moderate x, which is well constrained. In this framework the 
small-x gluon does not have any intrinsic uncertainty - its uncertainty is a reflection of moderate x. This 
is a feature of e.g. the CTEQ6 gluon uncertainty [18], where the input gluon is valence-like. In this case 
the percentage gluon uncertainty does not get any larger once x reaches about 0.001. The alternative 
treatment in [19] gives the expected increase in the gluon uncertainty as x ^ 0, since in this case the 
uncertainty is determined lai^gely by that in the input gluon at small x. The valence-like input form for a 
gluon is an example of fine-tuning, the form being unstable to evolution in either direction. The artificial 
limit on the small-x uncertainty is a consequence of this. 

4.7.3 Conclusions. 

I have demonstrated that different approaches to fitting parton distributions that use exactly the same 
data and theoretical framework produce partons that are very similar and have comparable uncertain- 
ties. There are certainly some differences due to the alternative approaches to dealing with experimental 
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Fig. 75: Left plot: xg{x, 20) from the MRST benchmark partons compared to that from the MRST2001 partons. Right plot: 
xg{x, 20) from the MRST benchmark partons compared to that from the MRST2001 partons with emphasis on small x. 

errors, but these are relatively small. However, the partons extracted using a very limited data set are 
completely incompatible, even allowing for the uncertainties, with those obtained from a global fit with 
an identical treatment of errors and a minor difference in theoretical procedure. This implies that the 
inclusion of more data from a variety of different experiments moves the central values of the partons in 
a manner indicating either that the different experimental data are inconsistent with each other, or that 
the theoretical framework is inadequate for coiTcctly describing the full range of data. To a certain extent 
both explanations are probably true. Some data sets are not entirely consistent with each other (even 
if they are seemingly equally reliable). Also, there ai^e a wide variety of reasons why NLO perturba- 
tive QCD might require modification for some data sets, or in some kinematic regions [89]. Whatever 
the reason for the inconsistency between the MRST benchmark partons and the MRSTOl partons, the 
comparison exhibits the dangers in extracting partons from a very limited set of data and taking them se- 
riously. It also clearly illustrates the problems in determining the true uncertainty on parton distributions. 
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4,8 Stability of PDF fits 

One of the issues raised at the workshop is the reliability of determinations of parton distribution func- 
tions (PDFs), which might be compromised for example by the neglect of NNLO effects or non-DGLAP 
evolution in the standard analysis, or hidden assumptions made in parameterizing the PDFs at nonper- 
turbative scales. We summarize the results of the CTEQ PDF group on this issue. For the full story 
see [168]. 

4. 8. 1 Stability of PDF determinations 

The stability of NLO global analysis was seriously challenged by an analysis [89] which found a 20% 
variation in the cross section predicted for W production at the LHC - a critical "standard candle" process 
for hadron colliders - when certain cuts on input data are varied. If this instability were confirmed, it 
would significantly impact the phenomenology of a wide range of physical processes for the Tevatron 
Run 11 and the LHC. The CTEQ PDF group therefore performed an independent study of this issue within 
their global analysis framework. In addition, to explore the dependence of the results on assumptions 
about the parameterization of PDFs at the starting scale Qq = 1.3 GeV, we also studied the effect of 
allowing a negative gluon distribution at small x - a possibility that is favored by the MRST NLO 
analysis, and that is closely tied to the W cross section controversy. 

The stability of the global analysis was investigated by varying the inherent choices that must be 
made to perform the analysis. These choices include the selection of experimental data points based on 
kinematic cuts, the functional forms used to parameterize the initial nonperturbative parton distribution 
functions, and the treatment of a^. 

The stability of the results is most conveniently measured by differences in the global for the 
relevant fits. To quantitatively define a change of that characterizes a significant change in the quality 
of the PDF fit is a difficult issue in global QCD analysis. In the context of the current analysis, we have 
argued that an increase by Ax^ ~ 100 (for ^ 2000 data points) represents roughly a 90% confidence 
level uncertainty on PDFs due to the uncertainties of the current input experimental data [18, 180-182]. 
In other words, PDFs with — XecstFit > are regarded as not tolerated by current data. 

The CTEQ6 and previous CTEQ global fits imposed "standard" cuts Q > 2 GeV and W > 
3.5 GeV on the input data set, in order to suppress higher-order terms in the perturbative expansion 
and the effects of resummation and power-law ("higher twist") coiTcctions. We examined the effect of 
stronger cuts on Q to see if the fits are stable. We also examined the effect of imposing cuts on x, which 
should serve to suppress any errors due to deviations from DGLAP evolution, such as those predicted 
by BFKL. The idea is that any inconsistency in the global fit due to data points near the boundary of 
the accepted region will be revealed by an improvement in the fit to the data that remain after those 
near-boundary points have been removed. In other words, the decrease in x^ for the subset of data that is 
retained, when the PDF shape parameters are refitted to that subset alone, measures the degree to which 
the fit to that subset was distorted in the original fit by compromises imposed by the data at low x and/or 
low Q. 

The main results of this study are presented in Table 21. Three fits are shown, from three choices 
of the cuts on input data as specified in the table. They ai^e labeled 'standard', 'intermediate' and 'strong'. 
A'^pts is the number of data points that pass the cuts in each case, and xjvptB ^ value for that subset 
of data. The fact that the changes in x^ in each column are insignificant compared to the uncertainty 
tolerance is strong evidence that our NLO global fit results are very stable with respect to choices of 
kinematic cuts. 

We extended the analysis to a series of fits in which the gluon distribution g(x) is allowed to be 
negative at small x, at the scale Qo = 1-3 GeV where we begin the DGLAP evolution. The purpose of 
this additional study is to determine whether the feature of a negative gluon PDF is a key element in the 

^'Contributing authors: J. Huston, J. Pumplin. 
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Cuts 


Qmin 


2^ mill 


A^pts 


Xl926 


Xl770 


Xl588 


a^^'^^B,, [nb] 


standard 


2GeV 





1926 


2023 


1850 


1583 


20.02 


intermediate 


2.5 GeV 


0.001 


1770 




1849 


1579 


20.10 


strong 


3.162 GeV 


0.005 


1588 






1573 


20.34 



Table 21: Comparisons of three fits with different choices of the cuts on input data at the Q and x values indicated. In these 
fits, a conventional positive-definite gluon parameterization was used. 



Cuts 


Qmin 


^min 


iVpts 


Xl926 




Xl588 


a\;^'^xBeu [nb] 


standard 


2 GeV 





1926 


2011 


1845 


1579 


19.94 


intermediate 


2.5 GeV 


0.001 


1770 




1838 


1574 


19.80 


strong 


3.162 GeV 


0.005 


1588 






1570 


19.15 



Table 22: Same as Table 21 except that the gluon parameterization is extended to allow negative values. 



Stability puzzle, as suggested by the findings of [89]. The results are presented in Table 22. Even in this 
extended case, we find no evidence of instabiUty. For example, for the subset of 1588 points that pass 
the strong cuts increases only from 1570 to 1579 when the fit is extended to include the full standard data 
set. 

Comparing the elements of Table 21 and Table 22 shows that our fits with g{x) < have slightly 
smaller values ofx^'- e.g., 2011 versus 2023 for the standard cuts. However, the difference Ax^ = 12 
between these values is again not significant according to our tolerance criterion. 



4.8.2 W cross sections at the LHC 

The last columns of Tables 21 and 22 show the predicted cross section for + W~ production at 
the LHC. This prediction is also very stable: it changes by only 1.6% for the positive-definite gluon 
parameterization, which is substantially less than the overall PDF uncertainty of aw estimated previously 
with the standard cuts. For the negative gluon parameterization, the change is 4%-larger, but still less 
than the overall PDF uncertainty. These results are explicitly displayed, and compared to the MRST 
results in Fig. 76. We see that this physical prediction is indeed insensitive to the kinematic cuts used for 
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Fig. 76: Predicted total cross section of -|- W~ production at the LHC for the fits obtained in our stability study, compared 
to the NLO results of Ref. [89]. The Q-cut values associated with the CTEQ points are given in the two tables. The overall 
PDF uncertainty of the prediction is ~ 5%. 
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the fits, and to the assumption on the positive definiteness of the gluon distribution. 

We also studied the stability of the prediction for aw using the Lagrange Multiplier (LM) method 
of Refs. [180-182]. Specifically, we performed a series of fits to the global data set that are constrained to 
specific values of aw close to the best-fit prediction. The resulting variation of versus aw measures 
the uncertainty of the prediction. We repeated the constrained fits for each case of fitting choices (param- 
eterization and kinematic cuts). In this way we gain an understanding of the stability of the uncertainty, 
in addition to the stability of the central prediction. 

Figure 77 shows the results of the LM study for the three sets of kinematic cuts described in 
Table 21, all of which have a positive-definite gluon distribution. The shown along the vertical axis is 




17 18 19 20 21 22 23 

CT^(LHC) 



Fig. 77: Lagrange multiplier results for the W cross section (in nb) at the LHC using a positive-definite gluon. The three 
curves, in order of decreasing steepness, correspond to the three sets of kinematic cuts labeled standard/intermediate/strong in 
Table 21. 

normalized to its value for the best fit in each series. In all three series, depends almost quadratically 
on aw- We observe several features: 

• The location of the minimum of each curve represents the best-fit prediction for cly^^ for the 
corresponding choice of cuts. The fact that the thrxe minima are close together displays the stability 
of the predicted cross section already seen in Table 21. 

• Although more restrictive cuts make the global fit less sensitive to possible contributions from 
resummation, power-law and other nonperturbative effects, the loss of constraints caused by the 
removal of precision HERA data points at small x and low Q results directly in increased un- 
certainties on the PDF parameters and their physical predictions. This is shown in Fig. 77 by 
the increase of the width of the curves with stronger cuts. The uncertainty of the predicted aw 
increases by more than a factor of 2 in going from the standard cuts to the strong cuts. 

Figure 78 shows the results of the LM study for the three sets of kinematic cuts described in 
Table 22, all of which have a gluon distribution which is allowed to go negative. 

We observe: 

• Removing the positive definiteness condition necessarily lowers the value of x^, because more 
possibilities are opened up in the minimization procedure. But the decrease is insignificant 
compared to other sources of uncertainty. Thus, a negative gluon PDF is allowed, but not required. 

• The minima of the two curves occur at approximately the same aw- Allowing a negative gluon 
makes no significant change in the central prediction - merely a decrease of about 1 %, which is 
small compared to the overall PDF uncertainty. 



119 



1.00 

17 18 



19 20 21 22 23 

CT^(LHC) 



Fig. 78: Lagrange multiplier results for the W cross section (in nb) at the LHC using a functional form where the gluon is not 
required to be positive-definite. The three curves, in order of decreasing steepness, correspond to the three sets of kinematic 
cuts labeled standard/intermediate/strong in Table 22. 



For the standard set of cuts, allowing a negative gluon PDF would expand the uncertainty range 
only slightly. For the intermediate and strong cuts, allowing a negative gluon PDF would signifi- 
cantly expand the uncertainty range. 
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Fig. 79: Left: mi-st2002 NLO (solid) and NNLO (dotted); Right: mrst2004 NLO (solid) and NNLO (dotted); Shaded region is 
uncertainty according to the 40 eigenvector sets of CTEQ6.1. 

We examined a number of aspects of our analysis that might account for the difference in conclu- 
sions between our stability study and that of [89]. A likely candidate seems to be that in order to obtain 
stability, it is necessary to allow a rather free parametrization of the input gluon distribution. This suspi- 
cion is seconded by recent work by MRST [183], in which a different gluon parametrization appears to 
lead to a best-fit gluon distribution that is close to that of CTEQ6. In summary, we found that the NLO 
PDFs and their physical predictions at the Tevatron and LHC are quite stable with respect to variations 
of the kinematic cuts and the PDF parametrization after all. 
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4.8.3 NLOandNNLO 

In recent years, some preliminary next-to-next-leading-order (NNLO) analyses for PDFs have been car- 
ried out either for DIS alone [184], or in a global analysis context [39] - even if all the necessary hard 
cross sections, such as inclusive jet production, are not yet available at this order. Determining the parton 
distributions at NNLO is obviously desirable on theoretical grounds, and it is reasonable to plan for hav- 
ing a full set of tools for a true NNLO global analysis in place by the time LHC data taking begins. At 
the moment, however, NNLO fitting is not a matter of pressing necessity, since the difference between 
NLO and NNLO appears to be very small compared to the other uncertainties in the PDF analysis. This 
is demonstrated in Fig. 79, which shows the NLO and NNLO gluon distributions extracted by the MRST 
group. The difference between the two curves is much smaller than the other uncertainties measured by 
the 40 eigenvector uncertainty sets of CTEQ6.1, which is shown by the shaded region. The difference 
is also much smaller than the difference between CTEQ and MRST best fits. Similar conclusions [185] 
can be found using the NLO and NNLO fits by Alekhin. 
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4.9 The neural network approach to parton distributions ^ 

The requirements of precision physics at hadron colliders, as has been emphasized through this work- 
shop, have recently led to a rapid improvement in the techniques for the determination of parton distri- 
bution functions (pdfs) of the nucleon. Specifically it is now mandatory to determine accurately the un- 
certainty on these quantities, and the different collaborations performing global pdf analysis [39,40, 186] 
have performed estimations of these uncertainties using a variety of techniques. The main difficulty is 
that one is trying to determine the uncertainty on a function, that is, a probability measure in a space 
of functions, and to extract it from a finite set of experimental data, a problem which is mathematically 
ill-posed. It is also known that the standard approach to global parton fits have several shortcomings: the 
bias introduced by choosing fixed functional forms to parametrize the parton distributions (also known as 
model dependence), the problems to assess faithfully the pdf uncertainties, the combination of inconsis- 
tent experiments, and the lack of general, process-independent error propagation techniques. Although 
the problem of quantifying the uncertainties in pdfs has seen a huge progress since its paramount impor- 
tance was raised some years ago, until now no unambiguous conclusions have been obtained. 

In this contribution we present a novel strategy to address the problem of constructing unbi- 
ased parametrizations of parton distributions with a faithful estimation of their uncertainties, based on 
a combination of two techniques: Monte Carlo methods and neural networks. This strategy, introduced 
in [187, 188], has been first implemented to address the marginally simpler problem of parametrizing 
deep-inelastic structure functions F{x,Q'^), which we briefly summarize now. In a first step we con- 
struct a Monte Carlo sampling of the experimental data (generating artificial data replicas), and then we 
train neural networks to each data replica, to construct a probability measure in the space of structure 
functions V [F{x, Q^)] . The probability measure constructed in this way contains all information from 
experimental data, including correlations, with the only assumption of smoothness. Expectation val- 
ues and moments over this probability measure are then evaluated as averages over the trained network 
sample. 
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where [F] is an arbitrary function of F{x,Q'^). 

The first step is the Monte Cai^lo sampling of experimental data, generating N^^p replicas of the 
original A'dat experimental data. 
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where r are gaussian random numbers with the same correlation as the respective uncertainties, and 
^stat ^ fjsys^ cjat are the statistical, systematic and normalization errors. The number of replicas A^rcp has 
to be large enough so that the replica sample reproduces central values, errors and correlations of the 
experimental data. 

The second step consists on training a neural network^^ on each of the data replicas. Neural 
networks are specially suitable to parametrize parton distributions since they are unbiased, robust ap- 
proximants and interpolate between data points with the only assumption of smoothness. The neural 
network training consist on the minimization for each replica of the defined with the inverse of the 
experimental covariance matrix, 
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(4.64) 



Contributing authors: L. Del Debbie, S. Forte, J. I. Latorre, A. Piccione, J. Rojo 
'For a more throughly description of neural network, see [187] and references therein 
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Fig. 80: Preliminary results for the NNPDF qns fit at Qq = 2 GeV^, and the prediction for -F'2^^(a;, Q^) compared with the 
CTEQ and MRST results. 



Our minimization strategy is based on Genetic Algorithms (introduced in [189]), which are specially 
suited for finding global minima in highly nonlinear minimization problems. 

The set of trained nets, once is validated through suitable statistical estimators, becomes the 
sought-for probability measure V [F{x, Q^)] in the space of structure functions. Now observables with 
errors and correlations can be computed from averages over this probability measure, using eq. (4.62). 
For example, the average and error of a structure function F{x,Q'^) at arbitrary (x, Q^) can be computed 

as 

= ^i7(net)(fc)(^^g2)^ ^(^^g2) ^ ^(^F{x,Q^)^) - {F{x,Q^)f . (4.65) 

A more detailed account of the application of the neural network approach to structure functions can 
be found in [188], which describes the most recent NNPDF parametrization of the proton structure 
f unction^^ . 

Hence this strategy can be used also to parametrize parton distributions, provided one now takes 
into account perturbative QCD evolution. Therefore we need to define a suitable evolution formalism. 
Since complex neural networks are not allowed, we must use the convolution theorem to evolve parton 
distributions in x— space using the inverse r(x) of the Mellin space evolution factor r(A^), defined as 

q{N, Q2) = q^N, Ql)T (iV, a, (Q^) , [QI)) , (4.66) 

The only subtlety is that the x-space evolution factor r(x) is a distribution, which must therefore be 
regulated at x = 1, yielding the final evolution equation, 

q{x, Q2) = q(:r, Q^) dy r(y) + £ ^r(y) (^q Qo) " VQi^, Qo)) , (4-67) 

where in the above equation Qg) parametrized using a neural network. At higher orders in per- 
turbation theory coefficient functions C{N) are introduced through a modified evolution factor, r(A^) = 
r(A^)C(A^). We have benchmarked our evolution code with the Les Houches benchmark tables [150] at 
NNLO up to an accuracy of 10~^. The evolution factor T{x) and its integral are computed and interpo- 
lated before the neural network training in order to have a faster fitting procedure. 

**The source code, driver program and graphical web interface for our structure function fits is available at 

http : / / Sophia . ecm . ub .es/f2neural. 
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As a first application of our method, we have extracted the nonsinglet parton distribution qNs{x, Qq) = 
^ [u + u — d — {x, Qq) from the nonsinglet structure function F^^{x, Q"^) as measured by the NMC 
[165] and BCDMS [163, 164] collaborations. The preliminary results of a NLO fit with fully correlated 
uncertainties [190] can be seen in fig. 80 compared to other pdfs sets. Our preliminary results appear 
to point in the direction that the uncertainties at small x do not allow, provided the cuiTcnt experimental 
data, to determine if qNsi^^Q"^) grows at small x, as supported by different theoretical arguments as 
well as by other global parton fits. However, more work is still needed to confirm these results. Only 
additional nonsinglet structure function data at small x could settle in a definitive way this issue^^. 

Summarizing, we have described a general technique to parametrize experimental data in an bias- 
free way with a faithful estimation of their uncertainties, which has been successfully applied to structure 
functions and that now is being implemented in the context of parton distribution. The next step will be 
to construct a full set of parton distributions from all available hard-scattering data using the strategy 
described in this contribution. 



Like the experimental low x deuteron structure function which would be measured in an hypothetical electron-deuteron 
run at HERA II, as it was pointed out during the workshop by M. Klein (section 3.6) and C. Gwenlan 
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5 Resummation ' 



5.1 Introduction 

An accurate perturbative determination of the hard paitonic cross-sections (coefficient functions) and 
of the anomalous dimensions which govern parton evolution is necessary for the precise extraction of 
parton densities. Recent progress in the determination of higher order contributions to these quantities 
has been reviewed in Sec. 4.2. As is well known, such high-order perturbative calculations display 
classes of terms containing lai^ge logarithms, which ultimately signal the breakdown of perturbation 
theory. Because these terms ai^e scale-dependent and in general non universal, lack of their inclusion can 
lead to significant distortion of the parton densities in some kinematical regions, thereby leading to loss 
of accuracy if parton distributions extracted from deep-inelastic scattering (DIS) or the Drell-Yan (DY) 
processes are used at the LHC. 

Logarithimic enhancement of higher order perturbative contribution may take place when more 
than one large scale ratio is present. In DIS and DY this happen in the two opposite Umits when the 
center-of-mass energy of the partonic collision is much higher than the characteristic scale of the process, 
or close to the threshold for the production of the final state. These correspond respectively to the small 
X and lai^ge x kinematical regions, where < x < 1 is defined in terms of the invariant mass A/^ of the 
non-leptonic final state as JVP = _ xhe corresponding perturbative contributions are respectively 

enhanced by powers of In ^ and ln(l — x), or, equivalently, in the space of Mellin moments, by powers of 
and In N , where ^ moments dominate as a; ^ while A^ — > oo moments dominate as a; — > 1. 

The theoretical status of small x and large x resummation is somewhat different. Large x logs 
are well understood and the corresponding perturbative corrections have been determined to all orders 
with very high accuracy. Indeed, the coefficients that determine their resummation can be extracted 
from fixed-order perturbative computations. Their resummation for DY and DIS was originally derived 
in [191, 192] and extended on very general grounds in [193]. The coefficients of the resulting exponenti- 
ation have now been determined so that resummation can now be performed exactly at N^LL [102, 194], 
and to a very good approximation at N^LL [195-197], including even some non-logarithmic terms [198]. 
On the other hand, small x logs are due to the fact that at high energies, due to the opening of phase space, 
both collinear [24,25,27] and high-energy [28-30, 199] logarithms contribute, and thus the coefficients 
required for their resummation can only be extracted from a simultaneous resolution of the DGLAP 
equation, which resums collinear logarithms, and the BFKL equation, which resum the high-energy log- 
arithms. Although the determination of the kernels of these two equations has dramatically progressed 
in the last several years, thanks to the computation of the N^LO DGLAP kernel [102, 103] and of the 
NLO BFKL kernel [28-30, 199-201], the formalism which is needed to combine these two equations, 
as required for sucessful phenomenology, has only recently progressed to the point of being usable for 
realistic applications [169,202-210]. 

In practice, however, neither small x nor large x resummation is systematically incorporated in 
cun^ent parton fits, so data points for which such effects may be important must be discarded. This 
is especially unsatisfactory in the case of large x resummation, where resummed results (albeit with a 
varying degree of logarithmic accuracy) are available for essentially all processes of interest for a global 
parton fit, in particulars besides DIS and DY, prompt photon production [211,212], jet production [213, 
214] and heavy quark electroproduction [215,216]. Even if one were to conclude that resummation is 
not needed, either because (at small x) it is affected by theoretical uncertainties or because (at large 
x) its effects are small, this conclusion could only be arrived at after a careful study of the impact of 
resummation on the determination of parton distributions, which is not available so fai\ 

The purpose of this section is to provide a first assessment of the potential impact of the inclusion 

**Subsection coordinator: S. Forte 

^'Contributing autliors: G. Altarelli, J. Andersen, R. D. Ball, M. Ciafaloni, D. Colferai, G. Corcella, S. Forte, L. Magnea, 
A. Sabio Vera, G. P. Salam, A. Stasto 
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of small X and large x resummation on the determination of parton distributions. In the case of large 
X, this will be done by determining resummation effects on parton distributions extracted from structure 
functions within a simplified parton fit. In the case of small x, this will be done through a study of the 
impact of small x resummation on splitting functions, as well as the theoretical uncertainty involved in the 
resummation process, in particular by comparing the results obtained within the approach of ref. [202- 
204] and that of ref. [169,205-210]. We will also discuss numerical approaches to the solution of the 
small-x (BFKL) evolution equation. 

5.2 Soft gluons 

With the cuiTcnt level of theoretical control of soft gluon resummations, available calculations for DIS 
or DY should be fully reliable over most of the available phase space. Specifically, one expects cuiTcnt 
(resummed) predictions for DIS structure functions to apply so long as the leading power correction can 
be neglected, i.e. so long as ~ (1 — x)Q'^ » h?, with x = xbj- Similarly, for the inclusive 
DY cross section, one would expect the same to be true so long as (1 — z)^Q^ » A^, where now 
z = Q'^ /s, with s = X1X2S the partonic center of mass energy squared. Indeed, as already mentioned, a 
consistent inclusion of resummation effects in parton fits is feasible with present knowledge: on the one 
hand, recent fits show that consistent parton sets can be obtained by making use of data from a single 
process (DIS) (see Sec. 3.3,4.6 and Ref. [167]), on the other hand, even if one adopts the philosophy of 
global fits, resummed calculations are available for all processes of interest. 

In practice, however, currently available global parton fits are based on NLO, or N^LO fixed-order 
perturbative calculations, so data points which would lie within the expected reach of resummed calcu- 
lations cannot be fit consistently and must be discarded. The effect is that large-x quark distributions 
become less constrained, which has consequences on the gluon distribution, as well as on medium-x 
quark distributions, through sum rules and evolution. The pool of untapped information is growing, 
as more data at large values of x have become available from, say, the NuTeV collaboration at Fermi- 
lab [217,218]. A related issue is the fact that a growing number of QCD predictions for various processes 
of interest at the LHC are now computed including resummation effects in the hard partonic cross sec- 
tions, which must be convoluted with parton densities in order to make predictions at hadron level. Such 
predictions are not fully consistent, since higher order effects are taken into account at parton level, but 
disregarded in defining the parton content of the colliding hadrons. 

It is therefore worthwile to provide an assessment of the potential impact of resummation on 
parton distributions. Here, we will do this by computing resummation effects on quark distributions in 
the context of a simplified parton fit. 

5.2. 1 General Formalism in DIS 

Deep Inelastic Scattering structure functions Fi{x, Q'^) are given by the convolution of perturbative co- 
efficient functions, typically given in the MS factorization scheme, and parton densities. The coefficient 
functions Cf for quark-initiated DIS present terms that become lai^ge when the Bjorken variable x for 
the partonic process is close to 2; = 1, which forces gluon radiation from the incoming quark to be soft 
or collinear. At 0{as), for example, the coefficient functions can be written in the form 

Cf (x, ^, a.(/z2)) = 5(1 -x) + (x, ^) + O • (5.68) 

Treating all quarks as massless, the part of which contains terms that are logarithmically enhanced 
as x ^ 1 reads 



ln(l — x) 
1 - X 



+ 



(1-x). 



(5.69) 
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In moment space, where soft resummation is naturally performed, the contributions proportional to 
as[ln(l — x)/{l — and to as[l/(l — correspond to double [aalv? N) and single (oslnA^) 
logarithms of the Mellin variable N. The Mellin transform of Eq. (5.69) in fact reads, at large N, 



frq 
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2Cf <! - In^ at + 



IE + 



IniV 



(5.70) 



All terms growing logarithmically with N , as well as all A^-independent terms corresponding to contri- 
butions proportional to (5(1 — x) in x-space, have been shown to exponentiate. In particular, the pattern 
of exponentiation of logarithmic singularities is nontrivial: one finds that the coefficient functions can be 
written as 

where 'R-{N, Q^/z^F) Q^s(Ai^)) is a finite remainder, nonsingular as ^ oo, while [193] 
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(5.72) 

In Eq. (5.72) the leading logarithms (LL), of the form a" In""*"^ N , are generated at each order by the 
function A. Next-to-leading logarithms (NLL), on the other hand, of the form a" hi" N , require the 
knowledge of the function B. In general, resumming N'^LL to all orders requires the knowledge of the 
function ^4 to A; + 1 loops, and of the function B to k loops. In the following, we will adopt the common 
standard of NLL resummation, therefore we need the expansions 
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(5.73) 



to second order for A and to first order for B. The relevant coefficients are 
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Notice that in Eq. (5.72) the term ~ A{as{k'^))/k'^ resums the contributions of gluons that are both soft 
and collinear, and in fact the anomalous dimension A can be extracted order by order from the residue 
of the singularity of the nonsinglet splitting function as x ^ 1. The function B, on the other hand, is 
related to collinear emission from the final state current jet. 

In [215, 216] soft resummation was extended to the case of heavy quark production in DIS. In 
the case of heavy quarks, the function B{as) needs to be replaced by a different function, called S{as) 
in [216], which is characteristic of processes with massive quarks, and includes effects of large-angle 
soft radiation. In the following, we shall consider values of much larger than the quark masses and 
employ the resummation results in the massless approximation, as given in Eq. (5.72). 



5. 2. 2 Simplified parton fit 

We would like to use large-x resummation in the DIS coefficient functions to extract resummed parton 
densities from DIS structure function data. Large-x data typically come from fixed-target experiments: in 
the following, we shall consider recent charged-current (CC) data from neutrino-iron scattering, collected 
by the NuTeV collaboration [217,218], and neutral-current (NC) data from the NMC [165] and BCDMS 
[163, 164] collaborations. 
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Fig. 81: NuTeV data on the structure function xF^, at = 12.59 GeV^ (a) and at = 31.62 GeV^ (b), along with the 
best fit curve parametrized by Eq. (5.76). 

Our strategy will be to make use of data at different, fixed values of Q^. We will extract from 
these data moments of the corresponding structure functions, with errors; since such moments factor into 
a product of moments of parton densities times moments of coefficient functions, computing parton mo- 
ments with errors is straightforward. We then compare NLO to resummed partons in Mellin space, and 
subsequently provide a translation back to x-space by means of simple parametrization. Clearly, given 
the limited data set we are working with, our results will be affected by compai^atively large errors, and 
we will have to make simplifying assumptions in order to isolate specific quark densities. Resummation 
effects are, however, clearly visible, and we believe that our fit provides a rough quantitative estimate of 
their size. A more precise quantitative analysis would have to be performed in the context of a global fit. 

The first step is to construct a pai^ametrization of the chosen data. An efficient and faithful 
parametrization of the NMC and BCDMS neutral-current structure functions was provided in [187, 188], 
where a large sample of Monte Carlo copies of the original data was generated, taking properly into 
account errors and correlations, and a neural network was trained on each copy of the data. One can then 
use the ensemble of networks as a faithful and unbiased representation of the probability distribution in 
the space of structure functions. We shall make use of the nonsinglet structure function F2^{x, Q^) ex- 
tracted from these data, as it is unaffected by gluon contributions, and provides a combination of up and 
down quark densities which is independent of the ones we extract from charged current data (specifically, 
F^''{x,Q^) gives u-d). 

As far as the NuTeV data are concerned, we shall consider the data on the CC structure functions 
F2 and F3. The structure function can be written as a convolution of the coefficient function with 
quark and antiquark distributions, with no gluon contribution, as 



We consider data for F3 at = 12.59 and 31.62 GeV^, and, in order to compute moments, we fit them 
using the functional form 



The best-fit values of C, p and 6, along with the per degree of freedom, are given in [219]. Here we 
show the relevant NuTeV data on xF^, along with our best-fit curves, in Fig. 81. 

The analysis of NuTeV data on F2 is slightly complicated by the fact that gluon-initiated DIS 
gives a contribution, which, however, is not enhanced but suppressed at large x. We proceed therefore 



xFs = - {xFI^ + xFl) =x IV P (9 - ^) ® CI 



(5.75) 



xFsix) = Cx~P{l - xf{l + kx) . 



(5.76) 
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Fig. 82: NuTeV data on the quark-initiated contribution to the structure function F2, for = 12.59 GeV'^ (a), and 
= 31.62 GeV^ (b). The solid lines are the best-fit predictions. 

by taking the gluon density from a global fit, such as the NLO set CTEQ6M [18], and subtract from F2 
the gluon contribution point by point. We then write F2 as 

F2^^ {F^ + Fi) = X I VI' [{q + q)(^Cl + g® C|] ^ F| + F| , (5.77) 

and fit only the quark-initiated part F|, using the same parametrization as in Eq. (5.76). Fig. 82 shows 
the data on F| and the best fit curves, as determined in Ref. [219]. After the subtraction of the gluon 
contribution from F2, the structure functions we ai^e considering (F^, xFa and F2^) we. all given in 
factorized form as 

F,(x,Q2) = ^^'|^^(^,^^)C7/(|,^,a.(/.2)) , (5.78) 

where Cf is the relevant coefficient function and qi is a combination of quark and antiquark distributions 
only. Hereafter, we shall take fi = = Q for the factorization and renormalization scales. At this 
point, to identify individual quark distributions from this limited set of data, we need to make some 
simplifying assumptions. Following [219], we assume isospin symmetry of the sea, u = d, s = s and we 
further impose a simple proportionality relation expressing the antistrange density in terms of the other 
antiquarks, s = nu. As in [219], we shall present results for k = ^. With these assumptions, we can 
explicit solve for the remaining three independent quark densities (up, down, and, say, strange), using 
the three data sets we are considering. 

Taking the Mellin moments of Eq. (5.78), the convolution becomes an ordinary product and we can 
extract NLO or NLL-resummed parton densities, according to whether we use NLO or NLL coefficient 
functions. More precisely. 

After extracting the combinations q-i, one can derive the individual quark densities, at NLO and including 
NLL large-x resummation. We concentrate our analysis on the up quark distribution, since experimental 
errors on the structure functions are too large to see an effect of the resummation on the other quai'k 
densities, such as d or s, with the limited data set we are using. 

5.2.3 Impact of the resummation 

We present results for moments of the up quark distribution in Figs. 83 and 84. Resummation effects 
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Fig. 83: NLO and resummed moments of the up quark distribution at = 12.59 GeV^ 
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Fig. 84: As in Fig. 83, but at = 31.62 GeV^ 



become statistically significant around 6 — 7 at both values of Q^. Notice that high moments of the 
resummed up density are suppressed with respect to the NLO density, as a consequence of the fact that 
resummation in the MS scheme enhances high moments of the coefficient functions. 

In order to illustrate the effect in the more conventional setting of j;-space distributions, we fit our 
results for the moments to a simple parametrization of the form u{x) = Dx~'^{l — x)^ . Our best fit 
values for the parameters, with statistical eiTors, are given in Table (23), and the resulting distributions 
are displayed in Fig. 85, with one standard deviation uncertainty bands. Once again, the effect of soft 
resummation is clearly visible at large x: it suppresses the quark densities extracted from the given 
structure function data with respect to the NLO prediction. 

In order to present the effect more clearly, we show in Fig. 86 the normalized deviation of the 
NLL-resummed prediction from the NLO one, i.e. Au{x) = (u^^Loi^) ~ ^rcs(2;)) /uNLoix), at the 
two chosen values of and for the central values of the best-fit parameters. We note a change in the 
sign of An in the neighborhhod of the point x = 1/2: although our eiTors are too large for the effect 
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3.025 ±0.534 
4.647 ±0.881 
2.865 ± 0.420 
3.794 ±0.583 



7 
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0.247 ±0.109 
0.463 ± 0.086 
0.351 ±0.090 



3.162 ±0.116 
3.614 ±0.128 
3.301 ± 0.098 
3.598 ±0.104 





Fig. 86: Central value for the relative change in the up quark distribution, Ait(x) = (ttNLo(a^) — Uvas{x)) /uNLo(a;), at 
Q2 = 12.59 (a) and 31.62 GeV^ (b). 



to be statistically significant, it is natural tiiat tiie suppression of the quark distribution at large x be 
compensated by an enhancement at smaller x. In fact, the first moment of the coefficient function is 
unaffected by the resummation: thus C^, being larger at large x, must become smaller at small x. The 
further sign change at x ~ 0.1, on the other hand, should not be taken too seriously, since our sample 
includes essentially no data at smaller x, and of course we are using an x-space parametrization of limited 
flexibility. 

Finally, we wish to verify that the up-quaik distributions extracted by our fits at = 12.59 and 
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31.62 GeV^ are consistent with perturbative evolution. To achieve this goal, we evolve our A^-space 
results at = 31.62 GeV^ down to 12.59 GeV^, using NLO Altarelli-Parisi anomalous dimensions, 
and compare the evolved moments with the direct fit at 12.59 GeV^. Figures 87 and 88 show that the 
results of our fits at 12.59 GeV^ are compatible with the NLO evolution within the confidence level of 
one standard deviation. Note however that the evolution of resummed moments appears to give less 
consistent results, albeit within eiTor bands: this can probably be ascribed to a contamination between 
pertubative resummation and power corrections, which we have not disentangled in our analysis. 
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Fig. 87: Comparison of fitted moments of the NLO up quark distribution, at — 12.59 GeV^, with moments obtained via 
NLO evolution from = 31.62 GeV^ 
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Fig. 88: As in Fig. 87, but comparing NLL-resummed moments of the up quark density. 



Qualitatively, the observed effect on the up quark distribution is easily described, at least within the 
limits of a simple parametrization like the one we are employing: resummation increases the exponent 
6, responsible for the power-law decay of the distribution at large x, by about 10% to 15% at moderate 
Q^. The exponent 7, governing the small-x behavior, and the normalization D, are then tuned so that the 
first finite moment (the momentum sum rule) may remain essentially unaffected. 
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In conclusion, our results indicate that quark distributions are suppressed at large x by soft gluon 
effects. Quantitatively, we observe an effect ranging between 10% and 20% when 0.6 < x < 0.8 at 
moderate Q^, where we expect power corrections not to play a significant role. Clearly, a more detailed 
quantitative understanding of the effect can be achieved only in the context of a broader and fully con- 
sistent fit. We would like however to notice two things: first, the effect of resummations propagates 
to smaller values of x, through the fact that the momentum sum rule is essentially unaffected by the 
resummation; similarly, evolution to larger values of will shift the Sudakov suppression to smaller 
X. A second point is that, in a fully consistent treatment of hadronic cross section, there might be a 
partial compensation between the typical Sudakov enhancement of the partonic process and the Sudakov 
suppression of the quark distribution: the compensation would, however, be channel-dependent, since 
gluon-initiated partonic processes would be unaffected. We believe it would be interesting, and phe- 
nomenologically relevant, to investigate these issues in the context of a more comprehensive parton fit. 

5.3 Small x 

Small x structure functions are dominated by the flavour singlet contribution, whose coefficient functions 
and anomalous dimensions receive logarithmic enhancements, which make perturbation theory converge 
more slowly. In the small x, i.e. high energy limit, the cross section is quasi-constant and characterised by 
the effective expansion parameter (as(fc^)) log ^ log ^f^, where x = Q'^ / s, < is the transverse 

min 

momentum of the exchanged gluon, s is the photon-proton centre of mass energy squared and is 
the hard scale. Such expansion parameter can be large, due to both the double-logs and to the fact that 
(fc^) may drift towards the non-perturbative region. Even assuming that truly non-perturbative effects 
are factored out — as is the case for structure functions — the problem remains of resumming the 
perturbative series with both kinds of logarithms [24,25,27-30, 199] 

In the BFKL approach one tries to resum the high-energy logarithms first, by an evolution equation 
in log 1/x, whose fe-dependent evolution kernel is calculated perturbatively in Ug. However, the leading 
kernel [28-30, 199] overestimates the hard cross-section, and subleading ones [108, 200, 201] turn out 
to be large and of alternating sign, pointing towards an instability of the leading-log x (Lx) hierarchy. 
The problem is that, for any given value of the hard scales Q, Qq <^ ^/s — think, for definiteness, of 
7*(Q)"7*(Qo) collisions — , the contributing kernels contain collinear enhancements in all fc-orderings 
of the exchanged gluons of type y/s » • • • fci ^ A;2 • • • , or y/s S> • ■ •k2 ^ fci • • • and so on, to all 
orders in a^. Such enhancements are only partly taken into account by any given truncation of the Lj; 
hierarchy, and they make it unstable. In the DGLAP evolution equation one resums collinear logarithms 
first, but fixed order splitting functions do contain [102, 103] high-energy logarithms also, and a further 
resummation is needed. 

Two approaches to the simultaneous resummation of these two classes of logs have recently 
reached the stage where their phenomenological application can be envisaged. The renormalisation 
group improved (CCSS) approach [202-204,220] is built up within the BFKL framework, by improving 
the whole hierarchy of subleading kernels in the collinear region, so as to take into account all the k- 
orderings mentioned before, consistently with the RG. In the duality (ABF) approach [169,205-210,221] 
one concentrates on the problem of obtaining an improved anomalous dimension (splitting function) for 
DIS which reduces to the ordinary perturbative result at large N (large x), thereby automatically satisfy- 
ing renormalization group constraints, while including resummed BFKL corrections at small N (small 
x), determined through the renormalization-group improved (i.e. running coupling) version of the BFKL 
kernel. 

We will briefly review the theoretical underpinnings of these two approaches in turn, and then 
compare phenomenological results obtained in both approaches. Note that we shall use the notation of 
the CCSS or ABF papers in the con^esponding sections, in order to enable a simpler connection with the 
original literature, at the price of some notational discontinuity. In particular. In ^ is called Y by CCSS 
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and ^ by ABF; the Mellin variable conjugate to In ^ is called uo by CCSS and N by ABF; and the Mellin 
variable conjugated to In is called 7 by CCSS and M by ABF. 

5.3.1 The renormalisation group improved approach 

The basic problem which is tackled in the CCSS approach [202-204, 220] is the calculation of the (az- 
imuthally averaged) gluon Green function Q{Y; k, ko) as a function of the magnitudes of the external 
gluon transverse momenta k = |fc|, fep = l^ol and of the rapidity Y = log This is not yet a hard 
cross section, because one needs to incorporate the impact factors of the probes [222-229]. Neverthe- 
less, the Green function exhibits most of the physical features of the hard process, if we think of A;^ , k^ 
as external (hard) scales. The limits k"^ ^ k^ (feg ^ k"^) correspond conventionally to the ordered 
(anti-ordered) coUinear limit. By definition, in the w-space conjugate to Y (so that cj = By) one sets 



g^(fc,fco) = [u;-/C^]-i(fc,fco), (5.80) 
ujg^{k,kQ) = 5^{k-ko) + J d^k' ICUk,k')g^{k',ko) , (5.81) 

where ICoj{k, k') is a kernel to be defined, whose = limit is related to the BFKL F-evolution kernel 
discussed before. 

In order to understand the RG constraints, it is useful to switch from fc-space to 7-space, where the 
variable 7 is conjugated to t = log fc^/feg at fixed Y, and to make the following kinematical remark: the 
ordered (anti-ordered) region builds up scaling violations in the Bjorken variable x = k^ / s {xq = k^/ s) 
and, if x {xq) is fixed instead of kko/s = , the variable conjugated to t is shifted [230] by an in- 
dependent amount, and becomes 7 + ^ ~ ^logfc^ (1 — 7 + f ~ ^logfcg)- Therefore, the characteristic 
function Xuj (7) of K^uj (with a factor factored out) must be singular" when either one of the variables 
is small, as shown (in the frozen as limit) by 



UJ 



1 1 



7+T 1-7+ 



7w ("s,w) + 



(5.82) 



where 7^^^ is the one-loop gluon anomalous dimension, and further orders may be added. Eq. (5.82) 
ensures the correct DGLAP evolution in either one of the coUinear limits (because, e.g., 7 + ^ ~ d-^^^ ^2) 
and is oj-dependent, because of the shifts. Since higher powers of uj are related to higher subleading pow- 
ers of as [231], this w-dependence of the constraint (5.82) means that the whole hierarchy of subleading 
kernels is affected. 

To sum up, the kernel /C^^ is constructed so as to satisfy the RG constraint (5.82) and to reduce to 
the exact Lx + NLx BFKL kernels in the a; — > limit; it is otherwise interpolated on the basis of various 
criteria (e.g., momentum conservation), which involve a "scheme" choice. 

The resulting integral equation has been solved in [202-204] by numerical matrix evolution meth- 
ods in k- and x-space. Furthermore, introducing the integrated gluon density g, the resummed splitting 
function Pcs{x, Q^) is defined by the evolution equation 



51ogQ2 



I ^ P^^{z,as{Q'))g[^,Q') , (5.83) 



and has been extracted [202-204] by a numerical deconvolution method [232]. Note that in the RGl 
approach the running of the coupling is treated by adopting in (5.81) the off-shell dependence of ag 
suggested by the BFKL and DGLAP limits, and then solving the ensuing integral equation numerically. 

It should be noted that the RGl approach has the somewhat wider goal of calculating the off-shell 
gluon density (5.80), not only its splitting function. Therefore, a comparison with the ABF approach, to 
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be discussed below, is possible in the "on-shell" limit, in which the homogeneous (eigenvalue) equation 
of RGI holds. In the frozen coupling limit we have simply 

xUois,7 - f ) = w , ixuj is at scale kko) . (5.84) 

Solving Eq. (5.84) for either uj or 7, we ai^e able to identify the effective characteristic function and its 
dual anomalous dimension 

= Xcfr(as, 7) ; 7 = 7cfr(as, , (5.85) 
in the same spirit as the ABF approach [169,205-210,221]. 

5.3.2 The duality approach 

As already mentioned, in the ABF approach one constructs an improved anomalous dimension (splitting 
function) for DIS which reduces to the ordinary perturbative result at large N (large x) given by: 

j{N, as) = a,7o(iV) + ahi{N) + aij2{N) .... (5.86) 

while including resummed BFKL corrections at small A'^ (small x) which are determined by the afore- 
mentioned BFKL kernel x(^^) ots)'- 

x{M,as) = asXo{M) + alxi{M) + (5.87) 

which is the Mellin transform of the u ^ 0, angular averaged kernel /C eq. 5.81 with respect to t = In 
The main theoretical tool which enables this construction is the duality relation between the kernels x 
and 7 

x{l{N,as),as) = N, (5.88) 

[compare eq. (5.85)] which is a consequence of the fact that the solutions of the BFKL and DGLAP 
equations coincide at leading twist [205,221,233]. Further improvements are obtained exploiting the 
symmetry under gluon interchange of the BFKL gluon-gluon kernel and through the inclusion of running 
coupling effecs. 

By using duality, one can construct a more balanced expansion for both 7 and x> the "double 
leading" (DL) expansion, where the information from x is used to include in 7 all powers of a^/N and, 
conversely 7 is used to improve x by all powers of ag/M. A great advantage of the DL expansion is 
that it resums the coUinear poles of x at M = 0, enabling the imposition of the physical requirement of 
momentum conservation 7(1, Os) = 0, whence, by duality: 

X(0,a.) = l. (5.89) 

This procedure eliminates in a model independent way the alternating sign poles +1/M, — 1/M^, 

that appear in xo, Xi^ These poles make the perturbative expansion of x unreliable even in the central 

region of M: e.g., OsXo has a minimum at M = 1/2, while, at realistic values of as, OsXo + Oi^Xi has 
a maximum. 

At this stage, while the poles at M = are eliminated, those at M = 1 remain, so that the DL 
expansion is still not finite near M = 1. The resummation of the M = I poles can be accomplished by 
exploiting the collinear-anticoUinear symmetry, as suggested in the CCSS approach discussed above. In 
Mellin space, this symmetry implies that at the fixed-coupling level the kernel x for evolution in In ^ 
must satisfy x(M) = x(l ~ M). This symmetry is however broken by the DIS choice of variables 
In i = In tIs- and by the running of the coupling. In the fixed coupling limit the kernel xdis> dual to the 
DIS anomalous dimension, is related to the symmetric one Xa through the implicit equation [108] 

XDis(M + l/2x.(M)) = Xa(M), (5.90) 
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to be compared to eq. (5.84) of the CCSS approach. 

Hence, the M = I poles can be resummed by performing the double-leading resummation of 
M = poles of xdis> determining the associated Xa through eq. (5.90), then symmetrizing it, and finally 
going back to DIS variables by using eq. (5.90) again in reverse. Using the momentum conservation eq. 
(5.89) and eq. (5.90), it is easy to show that Xa{M) is an entire function of M, with X(t(— 1/2) = 
Xct(3/2) = 1 and has a minimum at M = 1/2. Through this procedure one obtains order by order from 
the DL expansion a symmetrized DL kernel xdis> and its corresponding dual anomalous dimension 
7. The kernel xdis has to all orders a minimum and satisfies a momentum conservation constraint 
XDis(O) = XDis(2) = 1. 

The final ingredient of the ABF approach is a treatment of the running coupling corrections to 
the resummed terms. Indeed, their inclusion in the resummed anomalous dimension greatly softens the 
asymptotic behavior near x = 0. Hence, the dramatic rise of structure functions at small x, which char- 
acterized resummations based on leading-order BFKL evolution, and is ruled out phenomenologically, 
is replaced by a much milder rise. This requires a running coupling generalization of the duality equa- 
tion (5.88), which is possible noting that in M space the running coupling as{t) becomes a differential 
operator, since t — > d/dM. Hence, the BFKL evolution equation for double moments G{N, M), which 
is an algebraic equation at fixed coupling, becomes a differential equation in M for running coupling. 
In the ABF approach, one solves this differential equation analytically when the kernel is replaced by its 
quadratic approximation near the minimum. The solution is expressed in terms of an Airy function if 
the kernel is linear in a^, for example in the case of a^xo^ or of a Bateman function in the more general 
case of a non linear dependence on Ug as is the case for the DL kernels. The final result for the improved 
anomalous dimension is given in terms of the DL expansion plus the "Airy" or "Bateman" anomalous 
dimension, with the terms already included in the DL expansion subtracted away. 

For example, at leading DL order, i.e. only using 7o(A^) and xo(-^^)> the improved anomalous 
dimension is 



1 



^f\as,N)= a,7o(iV) + ahi(A^)+7.(if)-^ + 7a(co, a„ iV) - ^ + ./ ^[iV - a,co]. 
L iV TTiV J 2 V i^qOLs 

(5.91) 

The terms within square brackets give the LO DL approximation, i.e. they contain the fixed-coupling 
information from 70 and (through 7^) from xo- The "Airy" anomalous dimension 7yi(co, N) contains 
the running coupling resummation, i.e. it is the exact solution of the running coupling BFKL equation 
which corresponds to a quadratic approximation to xo near M = 1/2. The last two terms subtract the 
contributions to ^A{co,as,N) which are already included in 7^ and 70. In the limit ^ with N 
fixed, ^i{as,N) reduces to as^o{N) + O(a^). For ^ with Og/N fixed, 7/(0^,-^) reduces to 
Isijf) + 0{a1/N), i.e. the leading term of the small x expansion. Thus the Airy term is subleading 
in both limits. However, if — > at fixed ag, the Airy term replaces the leading singularity of the DL 
anomalous dimension, which is a square root branch cut, with a simple pole, located on the real axis at 
rather smaller N, thereby softening the small x behaviour. The quadratic approximation is sufficient to 
give the correct asymptotic behaviour up to terms which are of subleading order in comparison to those 
included in the DL expression in eq. (5.91). 

The running coupling resummation procedure can be applied to a symmetrized kernel, which 
possesses a minimum to all orders, and then extended to next-to-leading order [209,210]. This entails 
various technical complications, specifically related to the nonlinear dependence of the symmetrized 
kernel on a^, to the need to include interference between running coupling effects and the small x 
resummation, and to the consistent treatment of next-to-leading log terms, in particular those related 
to the running of the coupling. It should be noted that even though the ABF appraoch is limited to the 
description of leading-twist evolution at zero-momentum transfer, it leads to a pair of systematic dual 
perturbative expansions for the x and 7 kernels. Hence, comparison with the CCSS approach is possible 
for instance by comparing the NLO ABF kernel to the RG improved Lx-i-NLx CCSS kernel. 
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Fig. 89: The kernel x (BFKL characteristic function) for fixed couphng (/3o = 0) Qs = 0.2 and Uf = 0. The BFKL curves are 
the LO and NLO truncations of eq. (5.87), the DGLAP curve is the dual eq. (5.88) of the NLO anomalous dimension eq. (5.86), 
while the CCSS and ABF curves are respectively the solution u of eq. (5.84) and the solution Xdis of eq. (5.90). 

5.3.3 Comparison of results 

Even though the basic underlying physical principles of the CCSS and ABF approaches are close, there 
are technical differences in the construction of the resummed RG-improved (CCSS) or symmetrized DL 
(ABF) kernel, in the derivation from it of an anomalous dimension and associated splitting function, 
and in the inclusion of running coupling effects. Therefore, we will compare results for the resummed 
fixed-coupling x kernel (BFKL characteristic function), then the corresponding fixed-coupling splitting 
functions, and finally the running coupling splitting functions which provide the final result in both 
approaches. In order to assess the phenomenological impact on parton evolution we will finally compare 
the convolution of the splitting function with a "typical" gluon distribution. 

In fig. 89 we compare the solution, uj, to the on-shell constraint, eq. (5.84) for the RGI CCSS 
result, and the solution xdis of eq- (5.90) for the symmetrized NLO DL ABF result. The pure Lx and 
NLx (BFKL) and next-to-leading In (DGLAP) are also shown. All curves are determined with frozen 
coupling (/3o = 0), and with nj = 0, in order to avoid complications related to the diagonalization of the 
DGLAP anomalous dimension matrix and to the choice of scheme for the quark parton distribution. The 
resummed CCSS and ABF results are very close, in that they coincide by construction at the momentum 
conservation points M = ^ and M = 2, and differ only in the treatment of NLO DGLAP terms. In 
comparison to DGLAP, the resummed kernels have a minimum, related to the fact that both collinear and 
anticoUinear logs are resummed. In comparison to BFKL, which has a minimum at LO but not NLO, the 
resummed kernels always have a perturbatively stable minimum, characterized by a lower intercept than 
leading-order BFKL: specifically, when = 0.2, A ~ 0.3 instead of A ~ 0.5. This corresponds to a 
softer small x rise of the associated splitting function. 

The fixed-coupling resummed splitting functions up to NLO are shown in figure 90, along with the 
unresummed DGLAP splitting functions up to NNLO.^° In the CCSS approach the splitting function is 

^"starting from NLO one needs also to specify a factorisation scheme. Small-a; results are most straightforwardly obtained 
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Fig. 90: The fixed coupling (/3o ~ 0) xPgg{x) splitting function, evaluated with as — 0.2 and n/ = 0. The dashed curves 
are LO for DGLAP, NLa;+LO for CCSS and symmetrized LO DL for ABF, while the solid curves are NLO and NNLO for 
DGLAP, NLx+NLO for CCSS and symmetrized NLO DL for ABF. 

determined by explicitly solving eq. (5.81) with the kernel corresponding to figure 89, and then applying 
the numerical deconvolution procedure of [232]. For nj = the NLO DGLAP spUtting function has 
the property that it vanishes at small x — this makes it relatively straightforward to combine not just LO 
DGLAP but also NLO DGLAP with the NLLx resummation. Both the CCSS NLx+LO and NLx+NLO 
curves are shown in figure 90. On the other hand, in the ABF approach the splitting function is the inverse 
Mellin transform of the anomalous dimension obtained using duality eq. (5.88) from the symmetrized 
DL X kernel. Hence, the LO and NLO resummed result respectively reproduce all information contained 
in the LO and NLO x ^rid 7 kernel with the additional constraint of collinear-anticoUinear symmetry. 
Both the ABF LO and NLO results are shown in figure 90. 

In comparison to unresummed results, the resummed splitting functions display the characteristic 
rise at small x of fixed-coupling leading-order BFKL resummation, though the small x rise is rather 
milder (~ x~^'^ instead of ~ x~^'^ for Us = 0.2). At large x there is good agreement between the 
resummed results and the corresponding LO (dashed) or NLO (solid) DGLAP curves. At small x the 
difference between the ABF LO and CCSS NLx-i-LO (dashed) curves is mostly due to the inclusion 
in CCSS of BFKL NLx terms, as well as to differences in the symmetrization procedure. When com- 
paring CCSS NLx-i-NLO with ABF NLO this difference is reduced, and , being only due the way the 
symmetrization is implemented, it might be taken as an estimate of the intrinsic ambiguity of the fixed- 
coupling resummation procedure. At intermediate x the NLO resummed splitting functions is of a similar 
order of magnitude as the NLO DGLAP result even down to quite small x, but with a somewhat differ- 
ent shape, characterized by a shallow dip at a; 10~^, until the small x rise sets in for x ~ 10""^. It 
has been suggested [235] that in the small Os limit this dip can be explained as a consequence of the 

in the Qo scheme, while fixed-order splitting functions are quoted in the MS scheme (for discussions of the relations between 
different schemes see [107,206,220,234]). 
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Fig. 91: The running coupling xPgg{x) splitting function, evaluated with Ua = 0.2 and n/ = 0. The various curves correspond 
to the same cases as in figure 90. 



inteiplay between the — Og^ In x NNLO term of xPgg (also present in the resummation) and the first pos- 
itive resummation effects which start with an In^ l/x term. The unstable small x drop of the NNLO 
DGLAP result appears to be a consequence of the unresummed double pole in the NNLO anomalous 
dimension. 

The running-coupling resummed splitting functions are displayed in figure 91. Note that the unre- 
summed curves are the same as in the fixed coupling case since their dependence on as is just through a 
pref actor of a^, whereas in the resummed case there is an interplay between the running of the coupling 
and the structure of the small-x logs. All the resummed curves display a considerable softening of the 
small X behaviour in comparison to their fixed-coupling counterparts, due to the softening of the leading 
small X singularity in the running-coupling case [202,207]. As a consequence, the various resummed 
results are closer to each other than in the fixed-coupling case, and also closer to the unresummed LO and 
NLO DGLAP results. The resummed perturbative expansion appears to be stable, subject to moderate 
theoretical ambiguity, and qualitatively close to NLO DGLAP. 

Finally, to appreciate the impact of resummation it is useful to investigate not only the properties 
of the splitting function, but also its convolution with a physically reasonable gluon distribution. We take 
the following toy gluon 

xg{x) = x-°-^^{l-xf , (5.92) 

and show in fig. 92 the result of its convolution with various splitting functions of fig. 91. The differ- 
ences between resummed and unresummed results, and between the CCSS and ABF resummations are 
partly washed out by the convolution, even though the difference between the unresummed LO and NLO 
DGLAP results is clearly visible. In particular-, differences between the fixed-order and resummed con- 
volution start to become significant only for x < 10"^ — 10"^, even though resummation effects started 
to be visible in the splitting functions at somewhat larger x. 
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Fig. 92: Convolution of resummed and fixed-order Pgg splitting functions with a toy gluon distribution, eq. (5.92), normalised to 
the gluon distribution itself, with Us — 0.2 and n/ = 0. The resummed CCSS and ABF curves are obtained using respectively 
the CCSS NLx+NLO and the ABF NLO splitting function shown in fig. 91. 

It should be kept in mind that it is only the gg entry of the singlet splitting function matrix that has 
so far been investigated at this level of detail and that the other entries may yet reserve surprises. 

5.3.4 Explicit solution of the BFKL equation by Regge exponentiation 

The CCSS approach of section 5.3.1 exploits a numerical solution of the BFKL equation in which the 
gluon Green's function is represented on a grid in x and k. This method provides an efficient determi- 
nation of the azimuthally averaged Green's function and splitting functions — for percent accuracy, up 
to y = 30, it runs in a few seconds — for a wide range of physics choices, e.g. pure NLx, various 
NLx-i-NLO schemes. Here we will discuss an alternative framework suitable to solve numerically the 
NLL BFKL integral equation [236], based on Monte Carlo generation of events, which can also be ap- 
plied to the study of different resummation schemes and DIS, but so far has been investigated for simpler 
NLL BFKL kernels and Regge-like configurations. This method has the advantage that it automatically 
provides information about azimuthal decorrelations as well as the pattern of final-state emissions. 

This appproach relies on the fact that, as shown in Ref. [236], it is possible to trade the simple and 
double poles in e, present in D = 4 + 2e dimensional regularisation, by a logarithmic dependence on an 
effective gluon mass A. This A dependence numerically cancels out when the full NLL BFKL evolution is 
taken into account for a given center-of-mass energy, a consequence of the infrared finiteness of the full 
kernel. The introduction of this mass scale, differently to the original work of Ref. [108] was performed 
without angular averaging the NLL kernel. 

With such reguralisation of the infrared divergencies it is then convenient to iterate the NLL BFKL 
equation for the t-channel partial wave, generating, in this way, multiple poles in the complex w-plane. 
The positions of these singularities are set at different values of the gluon Regge trajectory depending 
on the transverse momenta of the Reggeized gluons entering the emission vertices. At this point it is 
possible to Mellin transform back to energy space and obtain an iterated form for the solution of the 
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Fig. 93: Analysis of the gluon Green's function as obtained from the NLL BFKL equation. The plot to the left shows the 
evolution in rapidity of the gluon Green's function at LL and NLL for fixed ka = 25 GeV and kt = 30 GeV. The plot on the 
right hand side shows the dependence on ka for fixed kb — 30 GeV and Y = 10. 



NLL BFKL equation: 

/(k,,k,,Y) = e-o('^")Y <5(2)(k,-k,) 



(5.93) 



n=l i=l 



dyi 
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where the strong ordering in longitudinal components of the parton emission is encoded in the nested 
integrals in rapidity with an upper limit set by the logarithm of the total energy in the process, yo = Y. 
The first term in the expansion corresponds to two Reggeized gluons propagating in the t-channel with 
no additional emissions. The exponentials caiTy the dependence on the Regge gluon trajectory, i.e. 



^0 (q) 



2\2 



/3o , q% q^A 
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+ 
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,(5.94) 



corresponding to no-emission probabilities between two consecutive effective vertices. Meanwhile, the 
real emission is built out of two parts, the first one: 



a. 



vr 
"3 



In 



Nr. 



(5.95) 



which cancels the singularities present in the trajectory order by order in perturbation theory, and the 
second one: ICr, which, although more complicated in structure, does not generate e singularities when 
integrated over the full phase space of the emissions, for details see Ref. [236]. 

The numerical implementation and analysis of the solution as in Eq. (5.93) was performed in 
Ref. [237]. As in previous studies the intercept at NLL was proved to be lower than at leading- 
logarithmic (LL) accuracy. In this approach the kernel is not expanded on a set of functions derived 
from the LL eigenfunctions, and there are no instabilities in energy associated with a choice of functions 
breaking the 7^1 — 7 symmetry, with 7 being the variable Mellin-conjugate of the transverse mo- 
menta. This is explicitly shown at the left hand side of Fig. 93 where the coloured bands correspond to 
uncertainties from the choice of renormalisation scale. Since the exponential growth at NLL is slower 
than at LL, there is little overlap between the two predictions, and furthermore these move apart for 
increasing rapidities. The NLL corrections to the intercept amount to roughly 50% and are stable with 
increasing rapidities. 
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Fig. 94; The 7-representation of the LL and NLL scale invariant kernels together with the coUinearly-improved kernel by an 
tJ-shift and the "all-poles" resummation. 



In transverse momentum space the NLL corrections are stable when the two transverse scales 
entering the forward gluon Green's function are of similar magnitude. However, when the ratio between 
these scales departs largely from unity, the perturbative convergence is poor, driving, as it is well-known, 
the gluon Green's function into an oscillatory behaviour with regions of negative values along the period 
of oscillation. This behaviour is demonstrated in the second plot of Fig 93. 

The way the perturbative expansion of the BFKL kernel is improved by simultaneous resummation 
of energy and collinear logs has been discussed in sections 5.3.1 ,5.3.2. In particular, the original approach 
based on the introduction in the NLL BFKL kernel of an all order resummation of terms compatible 
with renormalisation group evolution described in ref. [230] (and incorporated in the CCSS approach of 
section 5.3.1) can be implemented in the iterative method here explained [238] (the method of ref. [230] 
was combined with the imposition of a veto in rapidities in refs. [239-241]). The main idea is that the 
solution to the u-shift proposed in ref. [230] 



oj = a, 1+ a+— as 2V'(1)-V' T + TT-baJ -V" l-T + TT-ba 



+ «s [xi (7) + (^XO (7) - b) (V'(7) + ^''(l - 7)) 



7r2 



a+y)xo(7)), (5.96) 



can be very accurately approximated by the sum of the approximated solutions to the shift at each of the 
poles in 7 of the LL eigenvalue of the BFKL kernel. This provides an effective "solution" of Eq. (5.96) 
of the form [238] 



^ = asXo(7) +"5X1(7) + 




al I + 



(-l)"(2n)! (a. + aag)"+^ \ 
^Z.2-n!(n + l)!(^ + ^-ba,)2"+iJ 
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7 + m ^\7 + ?7i (7 + m)2 2(7 + 771) 

where xo ^rid xi are, respectively, the LL and NLL scale invariant components of the kernel in 7 repre- 
sentation with the collinear Umit 

^^^^^-7^72 273' 12 A^, 36iV3 36'" 8jVe QNj 12' ^ ' 

The numerical solution to Eq. (5.96) and the value of expression (5.97) ai^e compared in Fig. 94. The 
stability of the perturbative expansion is recovered in all regions of transverse momenta with a prediction 
for the intercept of 0.3 at NLL for = 0.2, a result valid up to the introduction of scale invariance 
breaking terms. The implementation of expression (5.97) in transverse momentum space is simple given 
that the transverse components decouple from the longitudinal in this form of the collinear resumma- 
tion [238]. The prescription is to remove the term — ^ In^ p- from the real emission kernel, tCr \<l-,k\. 
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Fig. 95: The behaviour of the NLL gluon Green's function using the Bessel resummation. 




Fig. 96: Distribution in the number of iterations and angular dependence of the NLL gluon Green's function. 



and replace it with 




2 {as + aa^) In^ — ag — aa^ + ha^^— — — In ^(5.99) 

with Ji the Bessel function of the first kind. This prescription does not affect angular dependences and 
generates a well-behaved gluon Green's function as can be seen in Fig. 95 where the oscillations in 
the coUinear and anticoUinear regions of phase space ai'e consistently removed. At present, work is in 
progress to study the effect of the running of the coupling in this analysis when the Bessel resummation 
is introduced in the iterative procedure of Ref. [236]. 

A great advantage of the iterative method here described is that the solution to the NLL BFKL 
equation is generated integrating the phase space using a Monte Carlo sampling of the different parton 
configurations. This allows for an investigation of the diffusion properties of the BFKL kernel as shown 
in ref. [242], and provides a good handle on the average multiplicities and angular dependences of the 
evolution. Multiplicities can be extracted from the Poisson-like distribution in the number of iterations 
of the kernel needed to reach a convergent solution, which is obtained numerically at the left hand side of 
Fig. 96 for a fixed value of the A parameter. On the right hand side of the figure a study of the azimuthal 
angular correlation of the gluon Green's function is presented at y = 5. This decorrelation will directly 
impact the prediction for the azimuthal angular decorrelation of two jets with a large rapidity separation, 
in a fully inclusive jet sample (i.e. no rapidity gaps). The increase of the angular correlation when the 
NLL terms are included is a characteristic feature of these corrections. This study is possible using this 
approach because the NLL kernel is treated in full, without angular averaging, so there is no need to use 
a Fourier expansion in angular variables. 
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